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Preface 


During the last decade of the last millennium, I taught a course on algebraic 
number theory, first at the University of Salzburg and the following year at 
Brigham Young University, for which I had prepared my own notes. They 
were inspired by two famous classics: (1) Alan Baker, Transcendental Number 
Theory and (2) Atiyah-MacDonald, Commutative Algebra. Both have cov- 
ered the essentials of the subjects in as few pages as possible. In 2003, my 
notes were published by Kendrick Press (a local publisher in Utah striving to 
make available rare books in mathematics, e.g., the first English translation of 
Riemann’s collected papers). This book is an expanded version of these notes. 
Thanks to the referees at whose suggestions now the first two chapters have 
been added for the convenience of the reader. Also, at their suggestion, the 
last chapter has been expanded considerably. I hope the book will be helpful 
at least to those who do not have time to plough through voluminous treatises 
but would like to know the basics of algebraic number theory. 


We have tried to keep the treatment of the subject (Chapters 3-9) as clas- 
sical as possible. In this form, it was developed originally by the German 
school, as summed up in Hilbert’s Zahlbericht. (See [24] in the Bibliography.) 
Later, the theory was recast, from a different and more abstract point of view. 
The newer approach, for which Weil’s Basic Number Theory [39] may be con- 
sulted, provides a broader theory to include the arithmetic of algebraic curves 
over finite fields, and even suggests a theory for studying higher dimensional 
varieties over finite fields. However, if one is not familiar with the classical 
algebraic number theory (of number fields), Weil’s book may be difficult to 
read. Commutative algebra also originated in algebraic number theory. The 
purpose of the last chapter is to indicate how the subject treated in this book 
leads naturally to the Weil conjectures and some delicate questions in alge- 
braic geometry. We shall discuss, without supplying complete details, some 
advantages of this approach to algebraic number theory. 


This book is suitable for an independent study or as a textbook for a small 
class. We present the subject as developed by its creators Dedekind, Dirichlet, 
Hilbert, Kronecker, Minkowski and Weber, etc. Besides a basic knowledge of 
elementary number theory and linear algebra, a first course in abstract algebra 
and in Galois theory (e.g., [40], [9], [23], and [1] or [31]) should more than 
suffice as a prerequisite. 


vii 


viii Preface 


I would like to thank Professor Darrin Doud for reading the final ver- 
sion of my notes published in 2003 and pointing out several misprints, and 
Professor Roger Baker for doing an outstanding job of proofreading. Thanks 
to Professor Wolfgang Herfort for thoroughly reading the current version of 
the book to point out errors and suggest improvements. I would also express 
my gratitude to Professor Fritz Schweiger for inviting me to the University of 
Salzburg. Last but not least, many thanks to Lonette Stoddard for her out- 
standing job with /Tpx. 


J. S. Chahal 
Provo, UTAH 
21 December 2020 
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Genesis: What Is Number Theory? 


1.1 What Is Number Theory? 


Number Theory is the study of numbers, in particular the whole numbers 
1,2,3,..., also called the natural numbers. The set of natural numbers is 
denoted by N. Leaving aside the unit 1, these numbers fall into two cat- 
egories: The indivisible numbers 2,3,5,7,... are the primes, and the rest 
4,6,8,9,10,... composed of primes, are the composite numbers. The following 
basic facts, with proofs, about these numbers were already known to Euclid 
around 300 B.C. 


Theorem 1.1. There are infinitely many primes. 


Theorem 1.2 (Fundamental Theorem of Arithmetic). Every natural number 
n> 1 is a unique product 


n=py...per (r>1) (1.1) 


of powers of distinct primes pi,...,pr, taken in some order. 


By looking at the list of primes, one can ask several naive but still unan- 
swered questions. For example, is there an endless supply of twin primes? We 
call a pair of primes gq, p twin primes if p = q+2. [This is the closest two odd 
primes can be to each other.] A glance at the list 


3,5; 5,7; 11,13; 17,19; 29,31;... 


suggests that there are infinitely many pairs of twin primes, but no one has 
ever been able to prove this so far. Another big problem in number theory is 
the unproven conjecture of Goldbach, which asserts that every even number 
larger than 2 is a sum of two primes. 


Many questions in number theory arise naturally in the study of geom- 
etry. The most fundamental fact in Euclidean geometry is the theorem of 
Pythagoras, which may be called the fundamental theorem of geometry. Ac- 
tually, it was known to the Egyptians and Babylonians about two thousand 
years earlier, but they had no rigorous proof of it like Euclid did. 
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Theorem 1.3 (Fundamental Theorem of Geometry). The real numbers 
O<a<y<z are the side lengths of a right triangle if and only of 


a? + y? = 27, (1.2) 


To number theorists, the most interesting solutions of (1.2), called the 
Pythagorean triples, are those with x, y, z whole numbers, such as (3,4, 5), 
(5, 12,13). It is said that the Egyptians used long ropes divided into three 
parts by two knots of lengths 3, 4 and 5 units. They knew that this way they 
could get error-free right angles for marking the bases for their huge pyramids. 


If (a, y,z) is a Pythagorean triplet, so is (cx, cy, cz) for all integers c > 0. 
But up to similarity, they all represent the same right triangle. The smallest 
among them is called a primitive Pythagorean triplet. One can ask if there are 
infinitely many primitive Pythagorean triplets. The Babylonians were aware 
of at least 15 of them. Since it is exceedingly difficult to find them by trial 
and error, they must have had an algorithm to produce them. Diophantus of 
Alexandria (third century A.D.) for sure had the algorithm to generate them 
all [cf. [41]]. 


By the squares and the cubes we shall mean the squares and cubes of 
whole numbers. Thus the above statement about Pythagorean triples is a 
statement about splitting a square into a sum of two squares. Mathematics of 
the Islamic world during its golden age was built upon the work of the Greek 
(Euclid, Archimedes, Diophantus) and Indian (Aryabhata and Brahmagupta) 
mathematicians. According to Dickson [13], the Islamic mathematician al- 
Khujandi (from Khujand, Tajikistan) was the first to claim that a cube cannot 
be split into two cubes, and gave an erroneous proof. However, it was Fermat 
who claimed that for all integer exponents n > 3, the equation 


xe” +y” = 2" (1.3) 


has no nontrivial solutions in the set Z = {0,+1, ..} of integers. This 
came to be known as Fermat’s Last Theorem or a for short. For n = 4, 
Fermat gave a proof by showing that even the equation 


gt +y* = 2? (1.4) 


has no nontrivial solutions in integers. The proof is based on the Fundamental 
Theorem of Arithmetic (cf. Exercise 3). It is easy to see that it suffices to prove 
FLT when the exponent n = 4 or n is a prime > 3. 


It was realized by Euler, Gauss and others that to prove FLT for ¢ = 3, the 
field Q of rational numbers is not adequate and one must deal with complex 
numbers and wish for the unique factorization in rings like Z[/—3] = {a + 
b\/—3 | a,b € Z}. Unfortunately, this is not true: 4 = 2-2 = (1+ /—3)(1— 
/—3) has two factorizations in Z[,/—3]. 
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To remedy this, one sees that the field K = Q(./—3) = {r+s/—3 | 1,8 € Q} 
is a field of fraction of Z[/—3] but also of a slightly bigger ring B, which is 
a unique factorization domain (UFD), of the so-called Eisenstein integers 
Z\w] = {a + bw | a,b,€ Z}, where w = —$ + V3 j is a primitive cube root of 
unity, i.e. w 4 1 is a root of the polynomial u? — 1. 


The idea of Euler to prove FLT for the exponent ? = 3 was to factor the 
left-hand side of 23 + y? = 2° as a3 +y? = (a+ y)(a@+wy)(x +w7y) and then 
use the divisibility arguments in the ring Z|w] that Fermat used in his proof 
for the exponent n = 4 in Z. 


The following characterization of the ring B above was crucial in the devel- 
opment of algebraic number theory. 


Theorem 1.4. The ring Z|w] is the set of all algebraic integers in the field 
K = Q(vV—3), te. the elements a of K that satisfy monic polynomials over 
Z. 


A polynomial is monic if its leading coefficient is 1. Clearly w is an algebraic 
integer, but it is not in Z[V—3]. 


Remark 1.5. By this definition, the set of algebraic integers in Q is the ring 
Z, as desired. 


One can try to prove FLT for all odd prime exponents by factoring its 
left-hand side as 


af ty! = (a@t+y)(xtCy)---(@+ Co") (1.5) 


in the ring Z[¢] = {a9 + ayO +--+ + ag_1C%? | a; € Z}, where ¢ = ¢y = 
cos aE +7sin oR is a primitive ¢-th root of 1. However, it turned out that the 
unique factorization does not hold in general for all Z[¢¢]. In fact, it holds if 
and only if € < 19. There is a larger class of primes, which Kummer called 
regular primes and proved FLT for. This was quite an achievement because 
only three primes 37, 59 and 67 under 100 are irregular. Although there is 
an algorithm to check whether or not a given prime ¢@ is regular [cf. Exercise 
5], it is not known if there are infinitely many regular primes, whereas the 


infinitude of irregular primes has been known for some time. 


Rather than trying to prove FLT, the school of nineteenth century German 
mathematicians tried to build a theory of rings (called the Dedekind domains) 
to recover the unique factorization for these rings. For this, Dedekind replaced 
the integers in Z by ideals a and the primes p = 2,3,5,... by nonzero prime 
ideals p in these rings. He then showed that every nonzero and non-unit ideal 
has a unique factorization 

a= py... Pp,” 


as powers of distinct prime ideals p,,...,p,, taken in some order. 
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In the ring Z, the integers a > 0 and primes p = 2,3,5,... can be identified 
by the principal ideals a = (a) = aZ = {am | m € Z} and the prime ideals 
p = (p) = pZ. However, the ideal (p) generated by a rational prime p = 
2,3,5,... may not be a prime ideal in the ring O of integers of a number field 
Kk. A number field by definition is a finite extension of the field Q of rational 
numbers. So the following question needed an answer: What is the unique 
factorization 

(p) = py. Pr” 
of (p) into a unique product of power of prime ideals p,,...,p, in the ring O? 
[The Germans called this ring an order and denoted it by O.] At the end of 
the next chapter we will discuss the ring Z/i], i = /—1, of Gaussian integers 
for clues to answer this question. 


A domain (commutative ring with 1 but without zero divisors) A is a prin- 
cipal ideal domain (or PID for short) if every ideal of A is generated by one 
element, i.e. is of the form (a) = {ax | « € A}. A domain is a unique factoriza- 
tion domain (UFD) if the unique factorization of nonzero, non-unit elements 
into a unique product of powers of distinct primes holds. It is well known 
that a PID is a UFD (but not conversely). So the Germans were interested 
in knowing how far the ring Ox of integers of a number field K can be from 
being a PID. They associated to each number field K a positive integer hx, 
called the class number of kK, which measures the deviation of Ox from being 
a PID. In particular, Ox is a PID if and only if hx = 1. They proved that 
hx is always finite. Kummer called an odd prime ¢ regular if it is not a factor 
of the class number of the field Q(¢e) = {co + c1Ge +--+ + ce1Gp | c; in Q} 
and proved FLT for all regular primes. (For proof, see [5, pp. 156 & 378].) 


1.2 Methods of Proving Theorems in Number Theory 


1. Unique Factorization Arguments 


The method that has been used since antiquity is the unique factorization. 
Let us recall Euclid’s proof of Theorem 1.1. 


It follows from the unique factorization (1.1) that any n > 1 is either a 
prime or has a prime factor. To prove Theorem 1.1 by contradiction, suppose 
there are only finitely many primes, say p;,...,p,;. Now consider the number 
n=p,...pr+1. It is not a prime because it is larger than every prime p;. So, 
it has a prime factor, say p,;. Therefore n = p,a for an integer a. This implies 
that 1 = p(a — p2...p,). This is a contradiction because 1 has no prime 
factor. | 
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Another example of such a proof is the proof below by Euler (1770) of the 
following claim of Fermat (1657): 27 is the only cube that exceeds a square by 
2. In modern terminology, (3, +5) are the only points with integer coordinates 
on the elliptic curve 


y =a 2, (1.6) 


Proof. In the ring Z[/—2] = {a + b/—2 | a,b € Z}, which is a UFD (see 
Exercise 8, Chapter 2), we use the factorization 


a = y? +2 = (y+ V—2)(y — V—2). 


In general, in a UFD, if a, 6 have no common factor other than units, and 
ap =y™ for an integer m > 0, then a = aj” and 8 = Pf" for some ay, 8; in 
it. Therefore 


yt+V/—2 = (a+ bV—2)? for a,b € Z. 
By expanding (a + b\/—2)? and comparing the real/imaginary parts, we get 


1 = b(3a? — 26"), y = a? — Gab’. (1.7) 


But the first equation in (1.7) can hold only if b = 1 and a = +1. This implies 
y = bd. 


2. Analytic Methods 


Euler initiated what we call the analytic number theory. The study of in- 
finite series (analysis) can lead to interesting results in number theory. Let 
us recall Euler’s proof of the infinitude of primes. Leaving aside the issue of 
convergence, by multiplying the infinite series formally, one sees that 


aoe -T0+s+S+-), Le. 
Pp 
loc) -1 
7 TC-5) » (18) 
Pp 


the product (called the Euler product) taken over all primes p. Note that the 
first equality is a consequence of the unique factorization (1.1). 


N-1 oo 
The partial sums }> + of the series }> + are bounded from below by the 
n=1 n=1 


area (cf. Figure 1.1) [ dz — \y N, which goes to infinity as N goes to infinity. 
& i & 
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FIGURE 1.1: Divergence of > +. 


Now if there were only finitely many primes, the right-hand side of (1.8) is 
finite whereas its left-hand side is infinite. This is a contradiction. | 


Let ¢(m) be the Euler ¢-function. For a positive integer m, it is the number 
of integers a (1 < a < m) such that a and m have no common factor > 1. 
For such an a, consider the semi-residue class S, = {a+ bm | b © N} modm. 
In 1857 Dirichlet used the Dirichlet L-series to prove that each S, contains 
infinitely many primes. Moreover, each S, contains its expected share oa of 
primes. 


3. Techniques from Algebraic Geometry 


Algebraic geometry is the study of the solutions of polynomial equations 
in a number of variables x1,...,2, with values of x; in a field K. Unless we 
assume K to be algebraically closed, such as the field C of complex numbers, 
the subject is not satisfactory. For example, 2? + y? +1 = 0 has no solution 
with x, y even in such a big field as R, the field of real numbers. Moreover, a 
line (equation of degree 1) is supposed to meet a circle (equation of degree 2) 
in two points. This rarely happens, but happens every time (in the projective 
plane P?(C)), thanks to Bezout’s Theorem: Two curves of degree d,, dz with no 
component in common intersect in dydz points in the projective plane P?(C), 
counted properly. 


The arithmetic algebraic geometry is the subject in which algebraic geo- 
metric methods are used to answer questions in number theory. We illustrate 
it by finding the primitive Pythagorean triples, which is the same as finding 
the rational points (points with rational coordinates) on the unit circle 


AP ey? = 1, (1.9) 


with the rational numbers X, Y in the lowest form. A primitive Pythagorean 
triple (x,y, z) gives such a rational point with X = =, Y = 4, and vice versa. 
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To obtain an algorithm to find all the primitive Pythagorean triples (2, y, z), 
we parameterize the unit circle (1.9) by the slope ¢ of the line through the 
fixed point (—1,0) and a variable point (X,Y) on this circle (cf. Figure 1.2). 


Substituting for X from the equation X = tY — 1 of this line in equation 
(1.9) of the unit circle, an easy calculation shows that 


Qt 1-#? 
Y = —— d X =tYy —1= —-. 
1+ ¢? = : 1+ t? 


If we run ¢ through all rational numbers in the lowest form t = , we get the 
following result: 


FIGURE 1.2: Rational points on the unit circle. 


Theorem 1.6. Every primitive Pythagorean triplet (x,y,z) is of the form 
g=a°—', y=2ab, z=a? +b’, 


where a, b (a > b) are positive integers of opposite parity (one odd, the other 
even) with no common factor. 


Note that the condition of opposite parity is necessary because otherwise 2, 
y, 2 are all even, so (a, y,z) is not primitive. We also remark that switching 
x and y does not produce a different Pythagorean triplet. 


EXERCISES 


1. Verify equation (1.5). 


2. Use Theorem 1.6 to find 15 primitive Pythagorean triplets (the same 
number of them as the Babylonians did). 


3. Proof of FLT for the exponent n= 4 
Example of Fermat’s method of descent: 


It is enough to show that x* + y* = w? has no solution either with 
x,y, w > 0. If it does, pick one with the smallest w. 
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i) Show that x”, y? and w is a primitive Pythagorean triplet, 
hence by Theorem 1.6, 


zg? =a" —b?, y? =2ab, w= 07 + 8 


with a > b > 0 coprime and of opposite parity. 


ii) Show that Theorem 1.6 is applicable again to x? + b? = a? to 


get 
g=s*—t?, b=2st, a=s?+t7 


with s >t > 0 coprime and of opposite parity. 


iii) Show that s, ¢ and a are relatively coprime, hence y? = 4ast 
implies that s, t and a are all squares, say u?, v? and c?. 


Hence, u* + v4 = c?, which contradicts the minimality of w. 
(a) Show that y? = x3 — 5 has no solution in integers. 


(b) Show that the only integer points on the elliptic curve 


y =n —4 


are (5, +11) and (2, +2). 


Hint: Factor 
y? +4 = (24 iy)(2— iy) 


in Z[i] and consider the cases of y odd and even. 
The rational numbers B,,(m > 1), defined by 


e Bm om 
Bs ml!” 


m=1 


are called the Bernoulli numbers. An odd prime @ is a regular 
prime <= £ does not divide the numerators of the Bernoulli num- 
bers By, B4,..., Be—3 (taken in the lowest form). Use this criterion 
to show that 31 is a regular prime, but 37 is not. 
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Review of the Prerequisite Material 


The reader is expected to be familiar with elementary number theory (cf. [40]), 
the language of groups, rings, fields, [23] and vector spaces as found e.g. in [9]. 


However, to avoid later interruptions we briefly review these concepts, 
mostly without proofs. Our main purposes in doing so are to i) make clear 
what is meant by certain mathematical terms, and ii) state the theorems in 
the form needed for their applications. 


2.1 Basic Concepts 


A group is a pair (G,*) of a nonempty set G and a binary operation * on G, 
ie.amap Gx G93 (a,y) > «*y € G, called the group law on G with the 
following properties: 


i) The group law is associative: for all x y, z in G, (a* y) * z= a * (y* 2), 


ii) there is an element e in G, called the identity, such that exa =x*xe =x 
for all x in G and 


iii) for each x in G there is a y in G, such that rx y =y*xu =e. 


We denote y by x~!, the inverse of x. We call the group (G,*) Abelian if 
for all x, yin G, x * y = y* a. In this case * is usually denoted by +, x~! by 
—a, and e by 0. We call —ax the additive inverse of x. Often the product x * y 
is written simply as zy and x~! is called the multiplicative inverse of x. 


It turns out that e and 2! are unique. The most familiar examples of 


Abelian groups are (G,+) with G = Z, Q, R and C. An example of a non- 
Abelian group is the general linear group GL(n,Z) of n x n matrices with 
integer entries and determinant +1 under matrix multiplication. 


A ring is set A with at least two distinct elements, denoted by 0 and 1 
having two binary operations (addition and multiplication) such that 


i) (A,+) is an Abelian group with 0 as its identity, 
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ii) la =a1=~- for all x in A and 


iii) the multiplication is associative and distributive over the addition: 


t(yt+z)=xyt+auz and (w+y)z=a"z+ yz. 


Remark. Some authors don’t require that 0 4 1, but we will. 


The ring A is commutative if cy = yx for all x, y in A. Two important 
examples of commutative rings are (Z,+, x) and the set Q[z] of polynomials 
with coefficients in Q under the usual addition and multiplication of polyno- 
mials. An example of a non-commutative ring is the set of 2 x 2 matrices with 
the addition and multiplication of matrices. Unless stated otherwise, all rings 
considered in this book are assumed to be commutative. Another example of 
such a ring we will meet is the ring Z/mZ = {0,1,2,...,m—1} of the residue 
classes of Z modm (m > 1 an integer). To add and multiply the elements of 
Z/mZ we add and multiply them as usual but mod out the sums and products 
by m, meaning keep only the remainders when divided by m. 


A field is a commutative ring K in which every element 4 0 has a multi- 
plicative inverse. The rings Q, R and C are fields but not Z. If p is a prime 
number, Z/pZ is a finite field of p elements and is denoted by F,. 


A module over a (always commutative) ring A, or simply an A-module is 
an Abelian group (M,+) together with a map A x M 35 (a,x) > ax € M, 
called the scaling of x by a, such that for all x, y in M and a, 6 in A, 


i) a(a+y)=ar+ay, 


ii) (a+b)a = ax + ba, 


ab)x = a(bx) and 


Hl 


( 
( 


lV 


) 
) 
) 
) toa 


The rings M = Z[i| and Z[w] of respectively Gaussian and Eisenstein in- 
tegers are modules over the ring A = Z. In general, if for a prime p 
¢ = cos an + isin ou is a primitive p-th root of 1, then Z[¢] = {ao + aig + 


+++ + Gy—1¢P | a; in Z} is a Z-module. 


To begin with, a field is a ring. A module over a field K is called a vector 
space over K. 


A nonempty subset Y of an object X (group, ring, field, module, vector 
space, etc.) is a subobject (subgroup, subring, subfield, submodule, subspace, 
etc.) if Y is closed under the operations for X. For example, if M is a module 
over a ring A, a non-empty set N of M is a submodule of M if for all x, y in 
N and all a in A, x + y and az are in N. A subobject Y of X is proper if 
Y #£ X. If Aisa subring of a ring B, then clearly B is an A-module. Similarly 
if k is a subfield of a field K, then K is a vector space over k. 
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Let X, Y be two objects in the same category, i.e. X, Y are either both 
groups, or both rings, etc. A homomorphism (or sometimes simply a mor- 
phism) is a map f : X — Y which preserves the operation(s) for the objects 
in the category. For example, if M and N are both modules over the same 
ring A, then the map f : M > N is a homomorphism of modules (linear 
transformation or linear map if A is a field) if for all x, y in M and all a in 
A, f(a@+y) = f(x) + f(y) and f(ax) = af(x). Two objects X, Y in the same 
category are isomorphic if there is a bijective homomorphism f : X + Y. We 
write it as X ~ Y. A bijective morphism f : X — X is called an automor- 
phism of X. 


A ring A may be regarded as a module over itself. In this case, a submodule 
a of A is called an ideal of the ring A. (This is precisely how the term ideal 
was coined by Dedekind in 1870, cf. [11, p. 96].) Every ideal of A contains its 
zero element. 


Often it is possible to divide an object X evenly by its subobjects to get 
the quotient X/Y. We illustrate it with a submodule N of a module M over 
a ring A. The equivalence relation ~ on M defined by x ~ y if and only if 
z—y isin N partitions M into disjoint sets, called the cosets of M by N, of 
the form «+ N = {a@+y]|y in N}. The following addition and scaling on the 
set M/N of these cosets turn M/N into a module over A, called the quotient 
of M by N: 


(c+N)+(y+N)=(a@+y)+QN, 
a(a+N)=ar4+N. 


As an example and another way to define the ring Z/mZ, take A = Z, and 
N = mdZ. Note that N is also an ideal of Z, so we can quotient a ring by its 
ideals. 


Let V be a vector space over a field k. We say that a subset S of V spans 
V if V = {c1u1 +--+ + Cnvn | cj ink, v; in S}, and V is finite-dimensional 
if there is a finite subset of V which spans V. The dimension dim,(V) of a 
finite-dimensional vector space V is the least number of vectors in V that span 
V. A set B with a minimal number of vectors that spans V is called a basis 
of V over k. 


A field K is called a field extension of a field k if k is a subfield of K. If kK 
is a field extension of k, we write it as K/k. If K/k is a field extension, clearly 
K is a vector space over k. If dim;,(K) = 1 is finite, we call K/k a finite field 
extension. We call n the degree of the field extension K/k and denote it by 
[Kk : k]. (The reason for this terminology will become obvious shortly.) 


Let M be a Z-module such that there is a finite subset {x,...,a,} of M 
with M = {a;21 +---+a,z, | a; in Z}. Such a subset with the smallest r is 
called a Z-basis of M. The example that will concern us is the ring Z[¢], ¢ a 
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primitive m-th root of unity, considered as a Z-module. One of our goals will 
be to find an explicit Z-basis for Z[d]. 


2.2 Galois Extensions 


We assume all our fields to be subfields of C. Let K/k be a field extension. 
The set Gal(K/k) of the field automorphisms o of K such that o(a) = a for 
all a in k is a (usually non-Abelian) group under the composition of maps. It 
is called the Galois group of K over k. In general, for a finite extension K/k, 
| Gal(K/k)| < [K : k]. We call K/k a Galois extension if the equality holds. 


Examples of Galois Groups 
First, let K/k be any field extension, not necessarily finite. Let a in K be 
a root of a polynomial 
f(x) =cot aa +--+ + ena” 
over k. If o € Gal(K/k), then 


f(a(@)) = co + c10(a) +--+ + en(o(a))” 
= 0(f(a)) = 0(0) =0. 


Thus o(q) is also a root of f(a). This simple observation will be crucial to 
what follows. 


Let K be a quadratic field, a field extension of Q of degree 2. Then one 
checks that (Exercise 16) K = Q(Vd) = {r+sVd|r,s € Q} for a square-free 
integer d £ 0, 1. 


Example 2.1. Let us take d = —1. There are exactly two automorphisms of 
K whose restrictions to Q is the identity map on Q. The identity map 1 on kK 
itself and o which takes 7 to its conjugate, the other root —i of x? + 1. Thus 
Gal(K/k) = {+1} and Q(¢) is a Galois extension of Q. 


Example 2.2. Now take d = —3. Then Q(w) = {r+sw | r,s € Q}. The Galois 
group Gal(K/k) consists of two elements, the identity automorphism 1 of K 
and the automorphism o of K such that o(w) =. [Note that @ = w? = 4)] 
Hence Q(w)/Q is also an Abelian extension. 


Example 2.3. Let a be the real cube root of 2, a = W2, K = Q(a) the 
smallest subfield of C containing a. The other cube roots of 2 which are 
wa and w’aq@ are not in K. Thus there is only one element in the Galois 
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group Gal(K/Q), namely the identity element of the group Gal(/Q). Since 
[K : Q| =3 but | Gal(K/Q)| = 1, the extension K/Q is not Galois. 


The following is a standard result in field theory: 
Theorem 2.4. If K/k is a field extension of degree d, then there is an a in 
K such that 1,a,a?,...,a¢~1 is a basis of K as a vector space over k. 

In fact, a is a root of an irreducible polynomial f(a) over k of degree d. 


Definition 2.5. If all the d roots of this f(a) are in K, we call the extension 
K/k normal. 


Remark 2.6. i) According to our definition of the Galois extension, an ex- 
tension is normal if and only if it is Galois. 


ii) The Galois group Gal(/k) is often defined only for normal extensions, 
in which case Gal(A/k) is always equal to the degree [Kk : k] of the field 
extension K/k. 


2.3 Integral Domains 


A nonzero element a of a ring A (always commutative) is called a zero divisor 
if ab = 0 for a nonzero b in A. In the ring Z/6Z, 2, 3, and 4 are the only 
divisors of zero. A field has no divisor of zero. A ring without zero divisors 
is called an integral domain or simply a domain. We have already discussed 
many integral domains which are not fields, e.g. Z, Zi], Z[w] and Z[Vd] for 
d #0, a square-free integer, which are relevant to our subject. 


An element u in A is a unit if uv = 1 for some v in B. For example, the 
only units in the ring Z are +1. 


Definition 2.7. A domain A is a Fuclidean domain if there is a map which 
assigns to each nonzero element a of A a non-negative integer d(@) such that 
for all nonzero a, 2 in A, 


i) d(a) < d(a@), and 


ii) A has elements gq (the quotient) and y (the remainder) so that a = q8+7 
and either y = 0 or d(y) < d({). 


With the Euclidean algorithm, both Z and the ring k[x] of polynomials over 
a field k are Euclidean domains. For Z, d(a) = |a| and for k[z], d(f(x)) = 


deg f(x). 
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Example 2.8. For an a = a+ bi in the field QJi], the conjugate of a is the 
element @ = a — bi of Q|i]. The norm of a is the rational number N(a) = 
ad = a? + b? which is non-negative and = 0 if and only if a = 0. Moreover, 
N(a8) = N(a)N(8). We show that the ring Z[i] is norm Euclidean, i.e. 
d(a) = N(qa) makes Z|?] a Euclidean domain. 

The condition i) in the definition is obvious. For ii) let a = a+ib, 6 = c+id 
be in Z[i]. Then 


a ac+bd be-ad. 


B 218 ie 
=A+iB, say. 


Note that A and B are in Q, and not necessarily in Z. 


Choose integers x and y so that |A — 2| < $ and |B —y| < 4. If we put 


q= x+y, a quick calculation shows that N (3 - 4) < 1. We take y to be 
a — gB. If-y #0, then N(y) = N (8 ($-a)) < N(B). | 


For a domain A and nonzero a, b in A, b is a divisor of a if a = bc for some 
cin A. We also say that b is a factor of a or a is a multiple of b. A nonzero 
and non-unit element 7 of A is a prime element of A if the only divisors of 7 
are ut with wu a unit in A. 


Definition 2.9. A domain A is a unique factorization domain (UFD for short) 
if each nonzero element of A which is not a unit is, up to a unit factor, a unique 
product of powers of distinct primes. A domain is a principal ideal domain (or 
PID) if each of its ideal a is principal, i.e. is of the forma = aA = {ax | x € A}. 


Theorem 2.10. Every Euclidean domain is a PID. 


Theorem 2.11. Every PID is a UFD. 


Thus every Euclidean domain is a UFD. In particular, the ring k[z] is a 
UFD. The following is a well-known result, but it will not concern us. 


Theorem 2.12. For d > 1, the ring Z[Vd] is norm Euclidean & d = 
2,3, 5,6, 7, 11, 13, 17, 19, 21, 29, 33,37, 41,55 and 73. 


Let M be a module over a commutative ring A with 1. We say that M is a 
finitely generated A-module if there is a finite set {x1,...,x,} of elements of 
M such that each x in M is a linear combination 


U= CX, +++ + C-x, (2.1) 
for some c; in A. A finitely generated module M over A is free of rank r if 


M = Ax, @:::@ Az,. (2.2) 
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This means that the representation (2.1) is unique for each x in M. 


The following fact from an undergraduate course in abstract algebra will be 
recalled when needed. For a proof, see [23]. 


Theorem 2.13. A finitely generated module over a PID is free. 


2.4 Factoring Rational Primes in Z/i| 


Let A be the ring Z|] of Gaussian integers and p = 2,3,4,...arational prime. 
This p may or may not be a prime element of A. To find exactly when it is, 
recall the famous theorem of Fermat on the sum of two squares, which was 
proved by Euler (cf. [8, p. 48]). 


Theorem 2.14 (Fermat). An odd prime p in Z is a sum of two squares 
(p = a? +b?) if and only if p=4k +1 fork inN. 


The norm of any divisor of a = a+ ib must be a divisor of N(a) = a? + b?, 
and for a = 6y with 8, 7 both non-units, 1 < N(8) < N(q@) (only the units 
have norm 1). Therefore, if a? + b? is a prime, then a has to be a prime in 
Z|i]. We have thus proved the following fact: 


Theorem 2.15. A prime p is a sum of two squares, p = a? +b? © p is a 
product (a + ib)(a — ib) of two primes at ib in Zii]. 


For p = 2, its two prime factors 1+7, 1—i in Z[#] are associates: 1+i = i(1—i). 

Therefore, 
2=i(1—i)*. 
We say that 2 ramifies in Z[i]. By Fermat’s Theorem (Theorem 2.15), p = 1 
(mod 4) © p is a product 
p= 72 

of two primes 71, 72 in Z[i]. Moreover, 7, and 72 are complex conjugates of 
each other and hence they are distinct. This discussion can be wrapped up as 
follows: In order to do that, observe that {1,i} is a Z-bases of Z[i] and so is 
its conjugate {1,—i}. These two bases make a 2 x 2 matrix 


1 i 
a & > 
with | det(A)| = 2, called the discriminant of Q(i). 


Theorem 2.16. Let p be a prime. Then 


i) p ramifies in Z[i] = it divides the discriminant of Q(t), 
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ii) p factors into two distinct primes of Z[i] @ p= 1 (mod 4), and 
iii) p stays prime in Zii] = p = 3 (mod 4). 


EXERCISES 


1. Asa prelude to Dirichlet’s Theorem on Units, let d > 1 be a square- 
free integer. Let Gg be the set of integer solutions (x,y) of the Pell 
equation 

x? — dy? = 1. (2.3) 


Define a binary operation * on Gg by 


(21, Y1) * (%2, y2) = (1% + dyry2, L1y2 + 241). 


Show that (Ga,*) is an Abelian group with identity (1,0) and 
(w,y)~* = (@, —-y). 


2. With d as in 1, let G be the group of 2 x 2 matrices é ) of 


determinant 1 with a, y integers, and G’ = (Z[Vd])*, the group of 
units in the ring Z[Vd] of positive norm. 


Show that G, G’, and Gq are all isomorphic to each other. In other 
words, the same group has been obtained in three different ways. 


3. It will follow from the Dirichlet Theorem that Gg = {+1} x Z. The 
elements of Gg with x > 0 are points on the parabola x? — dy? = 1 
(cf. Figure 2.1) in the right half of the plane. Those with x < 0 
come from —1 in the factor group {+1} of Ga. 


ge? — dy? =1 


FIGURE 2.1: Pell equation as hyperbola. 
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10. 


11. 
12. 


13. 


14. 


There are four generators of the free part of Gg. To find one, put 
y = 1,2,3,... until it becomes a square x7. (Dirichlet’s Theorem 
guarantees that sooner or later it will.) Find (a1, y1) with 71, y, > 0 
for Gq if d = 2,3, 15, 67, 77, 94. 


i) Use the group law in Gg to compute (x5, y5) = (21, y1)° for 
d= 2. 


ii) Rewriting the Pell equation as 


2 
1 
y y 


shows that : approximates Vd if y is very large. Use part i) to 
find a rational approximation to V2. 


iii) Compare this approximation with the one given by a computer. 
Show that Z[Vd] and Z[w] are rings. 


In Definition 2.7 of a Euclidean domain, show that the equality 
d(a) = d(a{) holds if and only if 6 is a unit. 


Show that the only units of Z[w] are +1, tw, +w?. 


Show that the ring Z[,/—2] is norm Euclidean. 


Show that Z[w] is norm Euclidean. 
Show that a Euclidean domain A is a PID. 


Hint: If a is a nonzero ideal of A, choose an element a # 0 in A 
with the smallest d(a). Show that a = (a). 


Give an example of a UFD which is not a PID. 


Let m be an ideal of a ring A. We call m mazimal if m # A and 
there is no ideal a with m GS a S A. We call an ideal p of a ring A 
prime if ab € p implies either a € p or bE p. 


Show that a maximal ideal is a prime ideal but not conversely. 


Show that m is a maximal ideal of A = A/m is a field and p a prime 
ideal of A = A/p is a domain. 


If f : A— Bis a ring homomorphism, show that 
i) its kernel Ker(f) = {x € A| f(x) = 0} is an ideal of A, 
ii) f is injective = Ker(f) = {0} and 


iii) if A is a field, then f is either the zero map or injective. 


18 


15. 


16. 
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First Isomorphism Theorem 


Let X, Y be both Abelian groups, both commutative rings with 1, 
or both modules over a commutative ring with 1 (vector spaces if 
this ring is a field). Let f : X — Y be a morphism of these two 
objects (in the same category). Let Ker(f) = {a € X | f(a) = 0} 
be the kernel of f whereas f(X) = {f(x) | « € X} its image. Show 
that X/Ker(f) = f(X). 


Show that any quadratic extension of Q is Q(Vd) for a square free 
integer d £ 0, 1. 


If @ is an odd prime and ¢ = (yp = cos 34 + isin 24, show that 


Q(¢) = {ao + a1€ + aad? +--+ + ap_i¢*! | a; € Q} is a field 
extension of Q of degree p — 1. 


Show that Gal(Q(¢)/Q) is isomorphic to the group of units of the 
ring Z/pZ. Thus Q(¢)/Q is a Galois extension. 


Hint: If o € Gal(Q(¢)/Q), then o(¢) is another root of unity. 


Let m be any positive integer, not necessarily a prime, and ¢ = 


2 ace 27 
cos 7 +2sin 7. 


Show that Q(¢)/Q is still Galois. What is its Galois group? 
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3.1 Generalities 


Let K/k be a field extension and suppose a is an element of K. We say 
that a is algebraic over k if a satisfies a nonzero polynomial over k. Suppose 
n = dim,(K) is finite and a isin K. Then the n+1 vectors 1,a,...,a@” cannot 
be linearly independent and hence satisfy a nontrivial linear relation 


cotcat:::+cna” =0, 


with c; in k. This not only shows that a is algebraic over k but also proves 
that it is a root of a nonzero polynomial of degree at most n over k. The 
smallest degree of a polynomial over k satisfied by a is called the degree of a 
over k. It is denoted by deg;,(q). 


3.2 Algebraic Integers 


The subject of algebraic number theory originated with Gauss, who studied 
the arithmetic in the ring Z[i] = {x + iy|x,y € Z} of the so called Gaussian 
integers. We begin with a useful fact about field extensions which is true for 
the ones to be dealt with in this book. 


Definition 3.1. A field extension K/k is a simple extension if there is an 
element a in K such that K = k(a). 


Here k(a) is the field of all quotients of polynomials in a over k. It is 
the smallest field containing /& and a. We say that K has been obtained by 
adjoining a to k. We also say that a generates Kk over k. 


From now on, we shall regard C, the field of complex numbers, as our uni- 
versal domain. This essentially means that all fields, unless stated otherwise, 
shall be subfields of C. A field must have at least two distinct elements, namely, 
0 and 1. Therefore, a subfield of C must contain Z, and hence it must be an 
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extension of Q. The following is a standard result from field theory (cf. [8, p. 
72)). 


Theorem 3.2. If k is a subfield of C, then any finite extension K/k is a 
simple extension. 


Definition 3.3. A number field is a finite extension of Q. A number field K 
is a quadratic field or a cubic field according as [K : Q] is 2 or 3. We call a field 
extension K/k an extension of number fields if k is a subfield of the number 
field K. Clearly, k is also a number field. 


Definition 3.4. A complex number a is an algebraic number if it is algebraic 
over Q. 


It is not hard to see that the set of all algebraic numbers is a subfield of C. 
It is called the algebraic closure of Q in C and is denoted by Q. 


Every element of a number field K with [K : Q] = n is an algebraic number 
of degree at most n. By Theorem 3.2, there is always an a in K with degg(a) = 
n. 


The following definition is crucial to what follows. 


Definition 3.5. A polynomial is monic if its leading coefficient is 1. 


Note that since an algebraic number satisfies a polynomial f(x) over Q, by 
clearing the denominators from the coefficients of f(x) if necessary, we may 
assume that the coefficients of f(z) are actually in Z. But then f(a) may 
not be monic. In other words, an algebraic number may not be a root of a 
monic polynomial over Z. The algebraic numbers which are roots of monic 
polynomials over Z play a central role in algebraic number theory and have a 
special name. 


Definition 3.6. An algebraic number is an algebraic integer if it is a root of 
a monic polynomial over Z. 


Theorem 3.7. The set of algebraic integers is a subring of C containing Z 
as a subring. 


We postpone the proof of this theorem, and denote the ring of all algebraic 
integers by O. It follows that for a field K the set ON K is a subring of Kk. It 
is denoted by Ox. For a number field K, the ring Ox has the same relation 
to K as Z has to Q. 


Definition 3.8. The ring Ox is called the ring of integers of K. 
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EXERCISE 
Prove that the ring of integers of Q is Z. 


[Hint: To prove this plug a = a/b in Q (taken in the lowest form) into the 
polynomial equation defining an algebraic integer. Show that b must divide 1.] 


Theorem 3.7 follows at once from a standard proposition in commutative 
algebra, which we shall prove after some preliminary remarks. Recall that for 
a commutative ring A with 1, the definition of a module over A, or simply 
an A-module, is the same as that of a vector space except that the scalars 
are taken from the ring A, which may not be a field. A module M over A is 
finitely generated or simply a finite A-module if M contains a finite set S which 
generates M/. This means that M consists of linear combinations of elements 
of S with coefficients in A. A commutative ring A with 1 is an integral domain, 
or for short a domain, if it has no nonzero divisors of zero, i.e. if x,y € A and 
xy = 0 implies that either x = 0 or y = 0. All subrings are assumed to contain 
1. 


Definition 3.9. Let A,B be two domains. Suppose A is a subring of B. An 
element of B is integral over A if it is a root of a monic polynomial over A. 


Proposition 3.10. Let a domain A be a subring of a domain B anda e€ B. 
The following are equivalent: 


(1) @ is integral over A. 
(2) The ring Ala] of polynomials in a over A is a finite A-module. 
(3) There is a finite A-module M 4 {0} such thataM © M. 
Proof. (1) implies (2): If 
a” +c,-1a" 1 +-+--+¢9 =0 (3.1) 
with c; in A, we put 
M=A+Aa+---+ Aa”!, 


Since by equation (3.1), n-th and higher powers of a can be expressed as linear 
combinations of the lower powers of a with coefficients in A, it is clear that 
Ala] = M, and the A-module M is generated by the finitely many elements 
lra,...,a"7t. 


(2) implies (3): The finitely generated A-module M = Ala] 4 {0} and 
aM CM. 
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(3) implies (1): Let M = Aa, +---+ Aan # {0}. Then aa; = yet AjjQy, 
or 


Q1 Q1 a1 
a}: |=PF]: ], ie (al-T)| : | =0, 
An An An 


where T is the n by n matrix (a,;) with entries in A and I the n by n identity 


matrix. Since the vector 
ay 


# 0, 


An 


the determinant det(aJ — T) = 0, which shows that a satisfies a monic poly- 
nomial over A. 


Proof of Theorem 3.7. Let a and 6 be algebraic integers. Then Z[a] and 
Z|@| are finitely generated Z-modules, say generated by the finite sets {a;} 
and {§;}, respectively. Moreover, M = Z[a, {| is not zero and is generated as 
a Z-module by {a;6;}. Now if y=a+t6 or a, it is clear that yM & M. By 
Proposition 3.10, y is an algebraic integer. This shows that O is closed under 
ring operations and hence it is a subring of C. Since 0 and 1 are algebraic 
integers, ZC O. 


Theorem 3.11. If @ is an algebraic number, then there is an algebraic integer 
a and a rational integer m (that is an m in Z) such that B = a/m. 


Proof. Suppose 
(nb +++- +6 +c =0 (3.2) 


with c; in Z and c, # 0. Let m =c,,. Multiplying the equation (3.2) through- 
out by m”—!, we obtain 


(mB)” a eae m”~*c, (mB) + m” +e = 0. 


This shows that a = m@ is a root of a monic polynomial over Z and hence is 
an algebraic integer a, which proves the theorem. 


Corollary 3.12. Any given number field K is the field of fractions of its ring 
of integers Ox. 
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3.3 Integral Bases 


Let A be a commutative ring with 1. Suppose M 4 {0} is an A-module. We 
say that M is a free A-module of rank n (n being an integer > 1) if there are 
n elements Q1,...,Q@, in M such that every element a of M can be uniquely 
written as 

A= ajay t::++anQn 


with a; in A. We write it as 
M = Aa, ®...® Aan. 


The set {a1,...,@,} is called a basis of M over A. If the elements of a basis 
are taken in a fixed order, it is called an ordered basis. In this section, we shall 
prove that for a number field K, its ring of integers Ox is a free Z-module of 
rank [K : k]. We recall some basic facts needed from linear algebra and Galois 
theory. 


Suppose x = (2;,;) is in M(n, A), that is x is an n by n matrix with entries 
in A. 


Definition 3.13. The trace tr(x) of x is the sum 44, +---+2ny of the diagonal 
entries of x. 


The following theorem is obvious. 


Theorem 3.14. Let x,y be in M(n,A) and a in A. Then 


(1) tr(a + y) = tr(x) + tr(y). 
(2) tr(ax) =a tr(x) 


(3) tr(ay) = tr(ye). 


Now suppose M a free A-module of rank n over A. Let 1: M > M bea 
homomorphism of A-modules, or simply an A-homomorphism. We associate 
a matrix L over A to A with respect to an ordered basis {a1,...,Q@,} of 
M over A in the same way as to a linear transformation. If L; and Lz are 
the matrices of \ with respect to two ordered bases, then Lz = P~!L,P for 
some P in GL(n, A), that is, for a matrix P over A whose determinant has 
multiplicative inverse in A. 


For the rest of the section, let K/k be an extension of number fields. Since 
the dimension dim; (A) cannot be more than dimg(K), it is clear that K/k 
is a finite extension. We may regard K as a k-module of rank n = [K : k]. For 
a in K, the multiplication by a is a k-homomorphism my: K > K. Let Lo 
be the matrix of ma, with respect to an ordered basis of K over k. 
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Definition 3.15. The relative norm and the relative trace for K/k of a are 

Nx/x(a) = det (La) 

trK/n(a) = tr(La). 

We know from linear algebra that det(P~'LP) = det(L) and tr(P~!LP) = 
tr(L). Therefore, these terms do not depend on the ordered basis chosen to 
define Lg and hence, are well defined. If there is no likelihood of confusion, 
we omit the adjective “relative” , 


Remark 3.16. The norm and the trace are maps from K to k. 


The following is obvious. 
Theorem 3.17. Fora, in K anda, b ink, 


(1) Nx/z (a8) = Nxjx(a)Nx/x(8) 
(2) trK/p(aa + bB) =a trK/Kn(@) + b trx/K (8). 


In particular, 


(1) Nepp(a) = 0". 
(2) trK/R(a) = na. 


By Theorem 3.2, any extension K’/k of number fields is a simple extension. 
Let K = k(a). If f(x) is the minimal polynomial of a over k, that is, the 
monic polynomial of the smallest degree over k of which @ is a root, then 
clearly f(a) is irreducible over k and deg;,(a) < [K : k]. Let deg f(x) =n. 


EXERCISES 


1. Show that f(x) has n distinct roots. 


[Hint: If a is a repeated root of f(x), then f(x) = (x — a)?g(zx) 
for some g(x). Differentiate this equation and put x = a in the 
resulting equation to get a contradiction.] 


2. Show that the field k(a) = the ring k[a] of polynomials in a over 
Ki 


[Hint: If g(a) is not zero, then f(x) and g(x) are coprime (why’). 
Show that (by the Euclidean algorithm) one can explicitly write 
1 = a(x)g(x) + b(x) f(a) for some a(x) and g(a) (how?). Put in this 
equation « = a to show that g(a) has a multiplicative inverse in 
ka]. 
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Thus [Kk : k] < deg f(a) < [K : k], which shows that [Kk : k] = deg f(x) =n. 
Hence, we obtain the following result. 


Theorem 3.18. For any extension K/k of number fields, there is an a in K, 
such that K = kla] and 
[Kk : k] = deg, (a). 


Remark 3.19. This explains why the dimension dim; (4), which we denoted 
by [K : k] is called the degree of K over k. It is the degree deg;,(a@) of the 
minimal polynomial of a that generates K over k. 


Remark 3.20. Suppose o : K — C is a ring homomorphism. Since by def- 
inition, g(1) = 1, the restriction ojg = the identity map on Q. Thus a ring 
homomorphism from K to C is a Q-homomorphism of Q-modules from K to 
C. In general, suppose K/k is an extension of number fields. We call a ring 
homomorphism ¢ : K — C a k-homomorphism if the restriction oj, is the 
identity map. 


Theorem 3.21. There are exactly |K : k] k-homomorphisms ao: K > C. 


Proof. Such a o fixes k, hence o(f(a@)) = f(o(a)) = 0, which shows that 
a(q) is also a root of f(x). Therefore, in view of K = k(a), o is determined by 
the value o(a), which can be any of the n distinct roots of f(x). This proves 
the theorem. 


Let 01,...,0n be these n = [K : k] k-homomorphisms. By Galois Theory, 
k={a€K|o;(a) =a, j=1,...,n}. 


Theorem 3.22. Ifa is an algebraic integer, then for every i, o;(a) is also 
an algebraic integer. 


Proof. Let f(x) be a monic polynomial over Z satisfied by a. Since o; fixes 
Q, we have f(a;(a)) = oi(f(a)) = 0. So o;(a) also satisfies the same monic 
polynomial over Z. 


To prove that Ox has a Z-basis, we begin with 


Theorem 3.23. The set {a1,...,Qn} °C K is a basis of K over k if and only 
if the matriz P = (a;(a;)) is non-singular. 


Proof. Let us prove the theorem first for a set of the form {1,a,...,a"~1} 
with a in K. Since K/k is a simple extension, such a basis exists. In this case 
det(P) is the well-known van der Monde determinant, which can be simplified 


to 
[[(eil) = 2;(a)). 


i<j 
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Therefore, det(P) is nonzero if and only if o1(a),...,o,(a) are all distinct. 
This happens if and only if the minimal polynomial of a over & has n (distinct) 
roots = deg, (a) =n > {1,a,...,a"~1} is a basis of K over k. 


For any two bases of K over k, the corresponding P are conjugate by a non- 
singular matrix. Therefore, if the theorem is true for one set {a1,...,Qn}, it 
is true for all of them. 


Theorem 3.24. For a in K, 


trK/n(@) = Se o;(a), 


Nx x(a) = [[ o;(0). 


Proof. Let {a1,...,Q@n} be a basis of K over k and write 


n 
aaj = ) ar j Ay 
r=1 


with a,; in Q. Apply the k-homomorphism o; to this equation to get 


a)o;(a;) => Tz Oly. \Orys 


which can be written as a single matrix equation DP = PLag, or 
D=PL,P", 


where P = (0;(a;)) and D is the diagonal matrix diag(o1(@),...,0n(a)). The 
theorem follows by taking the norm and the trace of the matrix D above. 


Now the following is almost obvious. 


Theorem 3.25. For a in Ox, the norm and the trace of a over K/Q are in 
Z. 


Proof. By Remark 3.16, the norm and the trace are in Q, whereas by Theo- 
rem 3.22 and Theorem 3.24, they are in Ox. Hence they are in QN Ox =Z 
(See Exercise 3.2). 


Now we prove the important theorem we mentioned earlier. 


Theorem 3.26. Suppose K is a number field with [K : Q|) =n. There are 
algebraic integers a1,...,Qn, such that 


OK = Za, ®-::@ Lay. 
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Proof. By Theorem 3.11, we can choose a basis {a1,...,@n} of K over Q 
consisting of algebraic integers. Let P be the n by n matrix (0;(a;)). Then, 
being fixed by every o;, 


A(a1,...,@n) = det(P?) (3.3) 


is a nonzero integer. Choose such a basis with minimum |A(qy,...,@,)|. Now 
suppose a € Ox and 
A= 4101, ++++ + GnQn 


with a; € Q. If an aj, say a; is not in Z, we can write 


aj=atr (0<r<l1). 


Put 
Wy = A-— aay = TA, + 4202 +++: + Anan 
and 
Wy = AZ 
for 7 > 2. 
It is clear that {w),...,w,} is also a basis consisting of algebraic integers. 


The transition matrix for these two bases is 


which gives A(w1,...,Wn) = r?A(a1,...,Qn), contradicting the minimality 
of |A(ay,...,Qn)|. 


Definition 3.27. The set {a,,...,a@,} in Theorem 3.26 is called an integral 
basis of the ring Ox (of integers of kK’), or for brevity, an integral basis of K. 


Remark 3.28. Any two integral bases are connected by an n by n matrix P 
in GL(n, Z), i.e. an invertible matrix P over Z, such that P~+ also has entries 
in Z. 


Definition 3.29. Let {a1,...,a,} be an integral basis of K and dx = A(m, 
..+;Q@m) the well-defined nonzero integer given by equation (3.3). We call dx 
the discriminant of Kk. 
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3.4 Quadratic Fields 


A number field K is a quadratic field if the degree [K : Q] = 2. By Theorem 
3.18, K = Q(a), where a is a root of an irreducible polynomial f(x) = ax? + 
ba + c of degree 2 over Q. Since a@ is not a rational number, the discriminant 
D = b? — 4ac of f(x) cannot be zero or a perfect square. Write D = dm?, 
with the integer d # 0,1, having no square factor larger than 1. From the 
quadratic formula for solving quadratic polynomial equations, it is clear that 
Q(a) = Q(./d). We summarize this as 


Proposition 3.30. A quadratic field K = Q(Vd) for a square-free integer 
d#0,1. 

The following theorem exhibits an integral basis of the ring of integers of a 
quadratic field explicitly. 


Theorem 3.31. Suppose d 4 0,1 is a square-free integer. Put 


Vd ifd=2,3 (mod 4) 
i 
1iVd ifd@=1 (mod 4) 


Then {1,w} is an integral basis of K = Q(Vd). 


Proof. First we show that Ox D Z+Zw. For this, all we need to show is that 
in case of d= 1 (mod 4), w = (1+ Vd)/2 is a root of a monic polynomial of 
degree 2 over Z. It is easy to see that x? —trx/g(w)a+Nx/,(w) € Z[a] is such 
a polynomial. Next we show that O« C Z+ Zw. Suppose a = a+ bVd € Ox 
with a,b € Q. We know that n = Nx /, (a) = a*—db?,m = trK/p(@) = 2a € Z. 
Now if m is even, then a € Z => db? € Z. Since d is square-free, this implies 
that b € Z. This shows that a € Z + Zw. If m is odd, then db? — + € Z. 
Since d is square-free, b = c/2 with c odd. This gives w = 1ivd andd=1 
(mod 4). 


Corollary 3.32. The discriminant of the quadratic field K = Q(Vd), where 
d#0,1 is square-free, is given by 


gine 4d if d=2,3 (mod 4) 
Ke )d ifd=1 (mod 4). 


Proof. The two Q-homomorphisms o; : K — C are the identity 0, = lx 
and the conjugation o2 defined by o2(x + yd) = x — yv/d. Let {a1, a2} be 
the integral basis of K given by Theorem 3.31. Using dx = (det(a;(a;)))?, a 
short calculation is all we need. 
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Proposition 3.30 above can now be made more precise. 


Theorem 3.33. If K is a quadratic field, then K = Q(Vdx), where dx is 
the discriminant of Kk. 


3.5 Unique Factorization Property for Ideals 


Let A be a commutative ring with 1. We recall the definition of an ideal of 
A. Suppose a is a nonempty subset of A. We say that a is an ideal of A, if for 
every a € A and z,y € a, av and x+y € a. Every ideal contains 0, the zero 
element of A. The whole ring A itself is an ideal. An ideal a of A is a proper 
ideal if A 2 a. If a 2 {0}, then we call a a nonzero ideal. 


Theorem 3.34. If 2 is a nonzero ideal of Ox, thena=2AMZ is a nonzero 
ideal of Z. 


Proof. If 0 4 a € 2, then a satisfies a nonzero monic polynomial over Z, 
Le. 
ap taya+::-+a"=0, 


with a; in Z and ao ¥ 0. Using the defining properties of an ideal, we see that 
---— ana" EANZ=a. 


ag = —a,a 


Let a be an ideal. The relation « ~ y © x—y € a is an equivalence relation 
which partitions A into disjoint sets of the form x +a= {x+al]a € a}, called 
the cosets of a in A. This set of cosets is a ring, called the quotient of A by a 
and is denoted by A/a. The ring operations on A/a are defined in an obvious 
way, namely, 


(x+a)t+(yta)=(rt+y) +a, (x+a)(y+a)=axzy +a. 


Remark 3.35. Let a be an ideal of A. The notation « = y (mod a) means 
that «—y €a. 


Definition 3.36. Suppose m is a proper ideal of A. We call m a maximal 
ideal if for no other proper ideal a, we can have m g a. We call a proper ideal 
p a prime ideal, if a, b € A, ab € p implies that either a € p or DE p. 
Theorem 3.37. Suppose a is an ideal of A. Then 


1. a is maximal if and only if A/a is a field. 


2. a is prime if and only if A/a is an integral domain. 
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Thus every maximal ideal is a prime ideal. 


Arithmetic operations on ideals 


Suppose a, 6 are two ideals of a ring A. We define their sum and product as 
follows. 


1. a+b={a+b|a€a,be b}, 


2. ab is the set of all finite sums a,b) +---+anbn with a; € a,b; € b. 
It is the smallest ideal containing all the elements of the form ab 
withacea,beb. 


Definition 3.38. Let a and 6 be two ideals of A with a 4 (0). We say that 
a divides 6 if a 2 b. We write it as alb. 


Remark 3.39. An ideal a of a commutative ring A with 1 is a principal ideal 
if it is generated by one element, i.e. a = (a) = aA = {ax|x € A}. Note that 
A = (1). We call A a principal ideal ring if every ideal of A is principal. A 
classical example of a principal ideal ring is Z, and the above definition of 
divisibility of ideals was suggested by this example. For a, b in Z, (a)|(6) if and 
only if ab. 


We now prove Dedekind’s famous theorem, namely, that for a number field 
K, if a F (0), (1) is an ideal of Ox, then it factors uniquely as 


a=pil...pe, (3.4) 


into a product of powers of distinct primes pj,...,p,, taken in some fixed 
order. For K = Q, Ox = Z and in Z, every ideal is principal. So if we identify 
the ideal (a) with the integer a > 1, the equation (3.4) becomes 


— fl e 
a=Pp, se eDp 


which says that every integer larger than 1 is a unique product of powers of 
distinct primes, taken in a fixed order. This is the fundamental theorem of 
arithmetic, and equation (3.4) generalizes the fundamental theorem of arith- 
metic to the ring of integers of a number field. 


Z-bases for nonzero ideals 


Let A be a commutative ring with 1. Recall that an ideal a of A is finitely 
generated, if it is finitely generated as an A-module, i.e. if there are finitely 
many elements a@1,...,@, in a such that 


a = (a1) +-:- + (Gn). 


The elements a1,...,@,, are generators of a. 
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Definition 3.40. The ring A is called Noetherian (after the great German 
mathematician Emmy Noether, 1882-1935) if every ideal of A is finitely gen- 
erated. 


Clearly, a principal ideal ring is Noetherian. The following theorem from 
commutative algebra provides an alternative way of defining a Noetherian 
ring. 


Theorem 3.41. Suppose A is a commutative ring with 1. The following are 
equivalent: 


(1) Every non-empty family of ideals of A has a maximal element, 
(2) A is Noetherian, 
(3) Every ascending chain 


a1 Gag Gag C--- 
of ideals is eventually stationary, i.e. there is an m, such that a, = 
an for alln >m. 


Proof. (1) = (2): Let a be an ideal of A and S be the family of finitely 
generated submodules of A contained in a. Clearly, S is not empty. [The zero 
ideal is there.] By (1), let 6 be a maximal element of this family. Then 6 = a for 
otherwise, choosing a in a but not in b, the finitely generated ideal ¢ = b + (a) 
would contradict the maximality of b. 


(2) = (8): By (2), the ideal a = Ua, is finitely generated and all its genera- 
tors are in a,, for a sufficiently large m. Then for all n > m, an = am. 


(3) = (1): Apply Zorn’s Lemma. We leave the details as an exercise. 


We now show that for a number field K, the ring Ox is Noetherian. Actually, 
we have a stronger result, namely, that every nonzero ideal of Ox is a free 
Z-module of maximum possible rank, which is [K : Q]. For this we need the 
following fact. 


Proposition 3.42. If M is a free Z-module of rank n and N is a submodule of 
M, then N is a free Z-module of rank r <n. Moreover, if M = Za,@...®Zan, 
we can write N=ZP, @...@ ZB, with r <n, where 


jai 


and aj; € Z with aj; > 0. 
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Proof. We use induction on n. If n = 0, there is nothing to prove. So let 
n > 1 and assume the theorem to be true for all M of rank < n — 1. Let 
M' =Zaz2®:::-®Zay,, N'=NN M’. 


The set EF = {a € Zl aa, + agag +--+ + Anan € N for some ag,...,Gn 
in Z}, being an ideal of Z, is of the form a1,Z with a1, > 0. Choose 6; = 
44101 +41202+-+-+@1nQ@p in N. Then N’ = {a—aay101|a@ = aay,a1,+--- € N} 
is a submodule of M’. Since M’ is a module of rank n — 1, we can choose 
B; (j = 2) as in equation (3.5). The set {f1,...,6,} is then a required Z-basis 
of N. 


Theorem 3.43. Any nonzero ideala of Ox is a free Z-module of rank [K : Q). 
In particular, Ox is Noetherian. 


Proof. By Proposition 3.42 above, we can certainly write 
a= Zp +++ +26, 
with r <n. All we need to show is that r =n. Let 


OK = Za, O-::@ Zap. 


By Theorem 3.34, choose 0 # a € aN Z. Then aay,...,aa, € a, which 
shows that the Z-rank of a is > n. Hence r = n. 


EXERCISE 
Show that the transition matrix between two Z-bases of a nonzero ideal of 
Ox is unimodular, i.e. it is in GL(n,Z). 


Theorem 3.44. For a nonzero ideal a of Ox, the quotient Ox /a is finite. 


Proof. Choose a #0 in Za. Then aOx € a € Ox. This gives a surjective 
map Ox/aOx — Ox/a. This proves the theorem, because the cardinality 
l\Ox/al < l\Ox~ /aOx| =a". 


Definition 3.45. The norm of a nonzero ideal a of Ox is the positive integer 
N(a) = |Ox/a|. We put N((0)) = 0. 


Remark 3.46. 


1. N(Ox) =1. 
2. If alb, then N(a) < N(b). 
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Theorem 3.47. Let Ox = Za, @...® Za, and for a nonzero ideal a of 
Ox, leta=ZB,8...P8ZCy with B = aes Ajj; (aii > 0). Then N(a) = 


Q11---Ann- 


Proof. One can show by induction on n = [K : Q| that the set {rja;|0 < 
r; <a} forms a complete system of coset representatives of Ox /a. We leave 
the details as an exercise. 


Dedekind domains 


We now define a Dedekind domain and show that 


1. The ring of integers of a number field is a Dedekind domain. 
2. In a Dedekind domain, the unique factorization theorem for ideals 
holds. 


Definition 3.48. Suppose A is a subring of an integral domain B with 1 such 
that 1 € A. The integral closure of A in B is the set of elements of B that 
satisfy a monic polynomial over A. 


Imitating the proof of Theorem 3.7, one can show that the integral closure 
of A in B is a subring of B containing A. 
Definition 3.49. Suppose A is an integral domain and K is its field of frac- 
tions. We say that A is integrally closed if its integral closure in K is A. 

We shall show that if K is a number field, then Ox is integrally closed. 


Definition 3.50. Suppose A is a commutative ring with 1. The Krull dimen- 
sion of A is the supremum of n such that there exist an ascending chain 


(0) =po Spi S...S Pn 
of prime ideals in A. It is denoted by dim(A). 
An important fact in algebraic geometry, that we shall neither need nor 
prove, is the following. 


Theorem 3.51. dim(A[z]) = dim(A) + 1. 


Example 3.52. 
1. If K is a field, dim(r) = 0. 
2. The Krull dimension dim(A) can be infinite. 


3. Since every maximal ideal is prime, dim(A) = 1 © every prime ideal of 
A is maximal. 
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4. The examples of rings A with dim(A) > 1 are provided by alge- 
braic geometry. For example, if & is a field and A = k[a,...,xp], then 
dim(A) = n. For a more interesting example, consider a nonsingular hypersur- 
face X in an n-dimensional space defined as the zeros of a single irreducible 
polynomial f(21,...,%,) € klay,...,%]. Its coordinate ring A = k[|X] = 
k[ai,...,%n]/(f) has Krull dimension n — 1, the dimension of the hypersur- 
face X as a geometric object. The study of an algebraic variety X, i.e. the 
zeros of a set of polynomials, is essentially the same as the study of its coordi- 
nate ring A (defined in the same manner as for a hypersurface). The geometric 
notions can thus be defined purely in ring theoretic terms, invented originally 
to study the arithmetic of number fields. For example, the intuitive notion 
of the dimension of a variety, which is one for curves and two for surfaces, 
coincides with the Krull dimension of its coordinate ring. This leads one to 
consider algebraic geometry as an extension of algebraic number theory. 


Definition 3.53. An integral domain A is a Dedekind domain if 


1. A is Noetherian, 
2. A is integrally closed and 
3. dim(A) = 1. 


Theorem 3.54. The ring Ox, of integers of a number field Kk, is a Dedekind 
domain. 


We have already shown (Theorem 3.43) that for a number field K, its ring of 
integers Ox is Noetherian. The next two theorems show that it is a Dedekind 
domain. 


Theorem 3.55. The ring Ox is integrally closed. 


Proof. We need to show that if a € K satisfies a monic polynomial over 
Ox, then it is in Ox. By Proposition 3.10, Ox [a] is a finitely generated Ox- 
module. Since Ox = Za; +---+ Zan, Z[a] is a finitely generated Z-module. 
By Proposition 3.10 again, a is an algebraic integer and hence is in Ox. 


Theorem 3.56. The ring Ox has Krull dimension 1. 


Proof. First note that a finite domain A = {x1,...,%m} is a field. [To see 
this let 0 A x € A. Then xz1,...,2% are all distinct, hence one of them 
has to be equal to 1.] Now if p is a prime ideal of Ox, then Ox/p is a finite 
domain, which shows that it is a field. But then p has to be maximal. 
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Fractional ideals 


For the rest of this section, we assume that A is a Dedekind domain and 
prove that its ideals have the unique factorization property. We denote by K 
the field of fractions of A. 


Definition 3.57. A subset 6 of K is a fractional ideal of K if b is a nonzero 
A-module and for a nonzero d € A, db C A. 


To avoid confusion, we shall call an ideal a € A an integral ideal. An integral 
ideal is a fractional ideal. [Take d = 1.] If 6 is a fractional ideal, then so is 
d~‘bo for every nonzero d € A. If a,b are fractional ideals, then so are a+ 6 
and ab. Suppose 6 is a fractional ideal. Choose d in A such that a = db © A. 
Then a is an (integral) ideal of A. Therefore, when A = Ox, every nonzero 
fractional ideal is a free Z-module of rank n = [K : Q]. Finally, we prove the 
following theorem of Dedekind. 


Theorem 3.58. Any nonzero ideal a # (1) of a Dedekind domain A is a 
unique product 


— ffl er 
a=p, saa Pp 


of powers of distinct prime ideals, taken in some fixed order. 

For the proof, we need following three lemmas from commutative algebra. 
Remember that A is a Dedekind domain. 
Lemma 3.59. If a prime ideal p divides the product a,...a, of ideals, then 
p divides a; for some j. 

Recall that a|6 if and only if aD 6b. 


Proof. If not, for each 7 we choose an a; in a; but not in p. Then a1...a, € 
a,...a, Gp. Since p is prime, some a; € p. This is a contradiction. 


Lemma 3.60. Suppose A is not a field. Then every nonzero ideal of A con- 
tains a product of prime ideals. 


Proof. Let S be the family of nonzero ideals that fail the theorem. We show 
that S is empty. If not, let a be a maximal element of this family, which 
exists because A is Noetherian. Clearly, a 4 A since, being a domain, A 
certainly contains the prime ideal (0). Also a is not prime, which implies that 
there are a), a2 in A but not in a, such that ajaq € a. Put a; = a+ (a,). 
Then a; 2 a. Hence, aj 2 p,...p, and ag 2 q,...qs. This implies that 


a a),d2 2 fp) ... p,q, ... qs. This is a contradiction. 


Lemma 3.61. In a Dedekind domain A, every nonzero prime ideal p is in- 
vertible, i.e. pq = (1) = A, for some fractional ideal q. 


We call q the inverse ideal of p and denote it by p7?. 
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Proof. Let p~t = {d € K|dp © A}. Clearly, p~! is an A-module containing 
A and p C p'p C A. Since p is maximal, either p~'p = A, in which case 
we are done, or p-!p = p, which we show is impossible. Suppose p~!p = p. 
Then for each x € p-', zp Cp, 2p Cp,...,2"p C p,... Taking M = p in 
Proposition 3.10, we see that x is integral over A. Since A is integrally closed, 
x € A which implies that p~! = A. We show that this is impossible. 


Suppose p-! = A. Choose 0 4 a € p. Then by Lemma 3.60, p 2 aA 2 
p1...p,r. Choose r the smallest possible. By Lemma 3.59, (after renumbering) 
p 2 py. Since dim(A) = 1, p = p,. Put a= po...p,. [If r = 1, then a = A. 
By minimality of r, aA D a. So choose x € a,a ¢ aA. Then pa C aA => xp © 
aAs>atapl Asatte epo!=Ax= 2 € aA, which is a contradiction. 


Proof of Dedekind’s Theorem 


Existence. Suppose S is the set of ideals of A, 4 (0), (1), for which the 
theorem is false. We show that S is empty. If not, choose a maximal element 
a in S. Then a is not prime and hence a g p, for a prime ideal p. Since 
A © pt, we have a € ap7! & pp7! = A. First we show that a S ap~!. By 
taking M = a in Proposition 3.10, the equality ap~! = a implies that every 
x € p' is integral over A. Since A is integrally closed, x € A, which shows 
that p~! = A, ie. p = A, which is not the case. Now by maximality of a, we 
have ap~! = po...p, > a= ppo...p,. This contradicts that a is in S. 


Uniqueness. If a = p1...Pr = qi...qs has two factorizations, we have by 
Lemma 3.59, pi 2 qi. But since dim(A) = 1, p1 = qi. By Lemma 3.61, we 


can cancel p, to get po...py = q2...qs, and so on. This process must end with 
no prime ideal left on either side. 


Remark 3.62. It is clear from the proof of Dedekind’s theorem that the 
inverse of a prime ideal is a fractional ideal. Hence the nonzero fractional 
ideals of kK (K being the quotient field of a Dedekind domain A) form an 
Abelian group under the multiplication of ideals. A is the identity of this 
group. 


Corollary 3.63. Ifa is a fractional ideal of a Dedekind domain, then except 
for ordering the prime ideals, we have a unique factorization 


Mise De 


Here 7 means q~'. For any nonzero ideal a of A, we set a° = (1) = A. 


Let a be a fractional ideal of a Dedekind domain A for a nonzero prime 
ideal p of A, the discrete valuation vp(a) of a at a prime ideal p is defined 
to be the exponent to which it appears in the unique factorization of a into 
powers of distinct prime ideals. It is an integer (positive, negative or zero). 
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For the map vp : K > Z given by vp,(a@) = up((a)), a £0 and vp(0) = ov, 
we have 
1. vup(aB) = vp(a) + p(B), and 
2. if vp(a) F vp(B), then vp(a + 8) = min(vp(Q), vp()). 
Definition 3.64. A map v : K* — Z with properties 1 and 2 is called a 
discrete valuation on K. 


Two discrete valuations on K are equivalent, if they can be scaled to give 
the same valuation. Among all the equivalent valuations, there is a unique 
one which subjects to Z. We will use this normalized valuation to represent 
all the valuations equivalent to it. For each prime p of Ox, vp is a normalized 
discrete valuation on K. 


Definition 3.65. The greatest common divisor (briefly, g.c.d.) of two ideals 


a,b is the ideal 
(a, 6) = TJ pine (avo), 
p 
Definition 3.66. Two ideals a and 6 are coprime if the g.c.d. (a, 6) = (1). 


The least common multiple [a,b] is defined by replacing min with max in 
the definition of the greatest common divisor. 


Let a and b be two ideals of a Dedekind domain A. To say that a divides b 
is equivalent to saying that vp(a) < vp(b) for all prime ideals p. 


EXERCISES 
1. Show that vp is a normalized discrete valuation on a number field 
K. 


2. Ifa, 6 are nonzero ideals of K, show that the g.c.d. (a,b) =a+6. 
Thus a and 6 are coprime © a+ 6 = (1) =A. 


3. Show that a principal ideal domain is a Dedeknd domain. 


4. Show that Z[,/—5] is a Dedekind domain, but not a PID. 


3.6 Ideal Class Group and Class Number 


In the last section, we proved that the nonzero fractional ideals of a Dedekind 
domain A with quotient field K, form an Abelian group J under the operation 
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of multiplication of ideals. The nonzero principal fractional ideals, that is the 
ideals of the form aA = {aala € A} with a £0 in K, form a subgroup P of 
I. The quotient group I/P is called the ideal class group of K. The elements 
of I/P are called the ideal classes. The cardinality of the ideal class group is 
called the class number of kK. We will denote the class number of K by hx. 
In this section we shall show that the class number of a number field is finite, 
in which A = Ox. 


Recall that for a nonzero ideal a of Ox, its norm N(a) is the cardinality of 
the quotient ring Ox /a. We have seen that this cardinality is finite. 


Theorem 3.67. Suppose (a) = aOx is a principal ideal of Ox. Then we 
have 


N((a)) = |Nxjo(@)I- 


Proof. If a = 0 there is nothing to prove, Otherwise, write Ox = Za; ® 
...®Zay, where n = [K : Q). By Proposition 3.42, we can also write ((a)) = 
ZB, ®...@B ZBn, where 

A= > Aji; 
jai 
with a; > 0. By Theorem 3.47, N((@)) = a11...@nn-. On the other hand, 
(a) = Zaa, @-:- 6 Zaan 
which shows that {aa1,...,aa,} is a Z-basis of (a). The transition matrix U 
from {(61,...,8n} to {aai,...,aa,} is unimodular. If for i < 7 we let a,; = 0 


and put M = (a;;), then 


a1 ay 


An An 


Therefore, 


INxa(@)| = |det(UM)] = | det(M)| = |N(aOx)]. 


Corollary 3.68. Ifa¢éN, then N(aOx) =a”. 


Theorem 3.69 (Independence of Valuations). Given distinct prime ideals 
pi,---,pr of a Dedekind domain A and integers a,,...,a, > 0, there is a in 
A, such that vp,(a@) = a;, for allt. 

Proof. Choose a; in p{* ...p@" which is not in pf?.. ey be tee ene pr 
and put a= a, +---+4a,. 
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Theorem 3.70. There is ana in K with prescribed “zeros” and “poles”, i.e. 
given distinct prime ideals p1,...,Pr; i,---,4s of a Dedekind domain A and 
positive integers aj,...,@,; b1,...,b5, there is an a in the quotient field K of 
A, such that vp,(@) = a; and vg, (a) = —b;. 


Proof. Choose 3 and y in A with vp,(8) = ai, vq,(y) = 6; and put a = 
B/y. 


Corollary 3.71. For two ideals a and b of a Dedekind domain A, there is an 
a in A with the g.c.d. (ab,(a@)) =a. 


Proof. If a = pf! ...p%", choose a as in Theorem 3.69. 


Theorem 3.72. Any ideal a of a Dedekind domain A can be generated by 
two elements. 


Proof. It is easy to see that given two ideals a and 6 of A, the g.c.d. (a, 6) = 
a+b. Choose 8 in K, such that Ga! = b is an integral ideal of A. Then 
ab = (3). Now choose a, such that a = (ab, (a@)) = ((8), (a)) = GA+QA. 


Corollary 3.73. For any nonzero ideal a of a Dedekind domain A, the quo- 
tient ring A/a is a principal ideal domain. 


Theorem 3.74. The norm is multiplicative, i.e. if a and b are two integral 
ideals of Ox, then 
N(ab) = N(a)N(b). 


Proof. Let N(a) = r and N(b) = s. Choose coset representatives 
{ai,...,a,-} and {(1,...,8s} of Ox /a and Ox/b, respectively and choose 
a as in Corollary 3.71. We show that {a; + a6;|i=1,...,57=1,...,shisa 
“complete set of coset representatives” of Ox /ab (which means the following). 


1. They are distinct mod ab. If a; + af; = a; + Bm (mod ab), then 
(a; — a1) + a(B; — Bm) € ab Ga. But a € a. Therefore, a;-—a, Ea > i=l=> 
a(8; — Bm) € ab. However, by our choice of a, this implies that 8; — Bm is in 
6, which gives 7 = m. 


2. The set contains a representative of every coset of A/ab. If x is in Ox, 
we can write « = a; +a for some a in a. But (ab,(a@)) = a. Therefore, 
a= Ba+c, c€ ab. Now 6 = 6;+b, b € b for some j. Hence « = aj +ah;+y 
for some y in ab. 


EXERCISE 


Let $8 be a prime ideal of Ox. Show that $M Z = p is a prime ideal of Z. 
(Hence, p = pZ for a unique prime number p.] 
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Theorem 3.75. There are only finitely many ideals of norm less than a given 
constant c. 


Proof. By the multiplicative property of the norm, it is enough to prove the 
theorem for prime ideals $8 of Ox. Given 8, there is a unique prime p in Z, 
such that NZ = pZ. Then P divides (p) = pOx. Therefore, 


N(®P) < |N((p))| =v", 


where n = [K : k]. But given a constant c > 0, there are only finitely many 
primes p with p < c, and given such a prime p, there are only finitely many 
38 that can divide (p). Only primes $8 that divide a p < c can have norm less 
than c. 


Theorem 3.76. There is a constant c = c(K), such that each nonzero ideal 
a © Ox contains an element a #0, with |N(a)| < cN(a). 


Proof. We use the pigeon-hole principle. Write 
Ox = Za, ®-::O Zan 


Let t be the smallest integer > N(a)!/" and look at the (t + 1)” distinct 
elements 
QAyQAy +++ + AnAn 


of Ox, where the integers a; satisfy 0 < a; < t. Since N(a) < ((+1)", among 
them there are two distinct ones which represent the same element of Ox /a. 
We show that their difference a = cja, +---+ Cn is the required nonzero 
element of a. Let Da be the matrix of the linear map “multiplication by a” 
with respect to the basis {a1,...,an} of K/Q. Similarly, let L; be the matrix 
of a;. By linearity of the map a + Ly, we have Ly = cL, +--+ +e,Ly. Since 
|c;| < t, we have 


|N(a)| = | det(Lq)| = | det(e1L1 +--+ +enLn)| < c(ai,...,Qn) max |e;|” 
< ct” < cN(a), 


where the constant c = c(aj,...,Q@,) depends only on aj1,...,@,, and hence 
only on K. 


EXERCISE 
Let K = Q(Vd) be a quadratic field with square-free d 4 0,1. Calculate 


c(a1,...,Qn). 


Theorem 3.77 (Dedekind). The class number hx of a number field K is 
finite. 
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Proof. If K is an ideal class, choose an integral ideal a in K~! and a nonzero 
a in a with |N(a)| < cN(a). Put 6 = (a)a~t, so that ab = (a). Then 


N(a)N(b) = N(ab) = N((a)) = |N(a)| < eN(a). 


This gives N(b) < c which implies that K contains an ideal b of norm < c. 
Since there are only finitely many ideals of bounded norm, there are only 
finitely many ideal classes K. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 
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Arithmetic in Relative Extensions 


Throughout this chapter, K will denote a number field and k a subfield of Kk. 
The extension K/k will be called a relative extension of number fields. We put 
o =O; and O = Ox. 


Theorem 4.1. Ox = {a € K| f(a) = 0 for a monic polynomial f(x) in ola} 

Proof. We only have to show that any a in K which satisfies a monic 
polynomial f(x) in o[{2] is an algebraic integer. Let 

f(z) =a9 +aye+--++an_12" 1! +2" 
with a; in o. Since a; are algebraic integers, 
M = Zlao,a,...,@n-1| 
is a finitely generated Z-module, and so is 
Zag, @1,-.-,@n—1,] = M+ Ma+---+Ma"™?, 


Since Zla] is a submodule of a finitely generated Z-module, Z[a] is also a 
finitely generated Z-module, which shows that a is an algebraic integer. 


Remark 4.2. This theorem allows us to regard K/Q as a special case of the 
relative extension A’/k of number fields with k = Q. 


EXERCISE 
Let & C K C L be number fields with [K : k] = n and [L : K] = m. Let 
O1,-+-,0n be the distinct k-isomorphisms of K into C. Show that each o; 


extends to m distinct k-isomorphisms o;; : L + C. 


Hint: Let L = K(@) and 71,...,7m be the m distinct K-isomorphisms of L 
into C. If we write a in L as 


= a9 +010 +2*> + Oy_10" 
with coefficients in kK, put 
ij (@) = o4(a0) + o4(a1) 7; (9) +--+ + oi(4m—1)T3(0"*). 


Recall the following. 
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Definition 4.3. For a relative extension K/k of number fields of degree n, 
the relative norm 


Nx/k :K +k 
is given by 
N(q@) = 01(@) +++ on(@), 
where 0j,...,0, : K — C are all the k-isomorphisms of K into C. 


The relative trace 
Trek :Kok 


is defined similarly by 


Tre (@) => a1(@) feet On(Q). 


Clearly, 


1. Nx/x (a8) = Nxjx(a@)Nx/x(8) and Nx x(a) = a”, ifaek. 


2. TrK/x(a@ + B) = Trxy, (a) + Tre, (8) and Tr /_(a) = na, if 
ack. 


EXERCISES 


1. Suppose k C Kk C L. Show that 
(a) Nose = Nxje- Nx/x, and 
(b) Trp, = Tre TrosK- 
[Hint: Use Exercise 4.] 


2. Let o be a k-isomorphism of K into C for the relative extension 
K/k of number fields. If 2 is an ideal in O, show that (2) is an 
ideal in the ring O,(4) of integers of a(K). 


3. Let K/k be an extension of number fields and L the normal closure 
of K in C. [Recall that L is the smallest subfield of C containing 
o;(K) for all the k-isomorphisms o1,..., 0 of K into C.] Let 2 be 
an ideal in O = Ox. For each j, 0; (2) is an ideal of o;(O). Let 
01 (Ql) +++ a, (2) denote the ideal of O; generated by the products 
a1-+-Q, with a; in ;(2). We call the ideal 0) (2) ---o, (2) No of 
o the relative norm of the ideal 2 and denote it by Nx 7, (2). 


Show that 
i) Nxje(QB) = Nn (2)Nx/x(B), and 


ii) if a is an ideal in o, then Nx /,(a) =a”. 
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Now suppose that A’/k is an arbitrary but fixed extension of number fields. 
Let o = O; and O = Ox. We know that any nonzero ideal in o is a unique 
product of powers of prime ideals in o and the same is true for O also. However, 
a prime ideal p of o need not generate a prime ideal of O. 


Example 4.4. Ifk =Q, kK = Q(t), then for a=2+4%, 
50 = (a)(@). 


Since N(a) = N(@) = 5 is not a unit in Z, (a) and (@) are both proper ideals, 
each is divisible by a prime ideal of O. Hence, 50 is not a prime ideal of O. 


Suppose p is a prime ideal in o and pO the ideal generated by p in O. Let 
pO = By --- Por (eg 2 1) (4.1) 


be its unique factorization into powers of distinct primes ‘B1,..., 8, in O. We 
call B1,..., PB, the prime divisors of p in Ox, or by abuse of language, in K. 
The positive integers e; = e(%;/p) are the exponents of $B; over p. 


If % is a prime divisor of p in K, we write it as Blp. If PBlp, then PNo = p. 
Definition 4.5. Let |p. Then $ is ramified over p if the exponent e(%/p) > 
1. Further, p is ramified in Kk if a B in K dividing p is ramified, otherwise, 


p is unramified. If $8 is ramified over p, the integer e(B/p) > 2 is called the 
degree of ramification of §8 over p. 


If |p, there is an obvious inclusion, 
o/p > O/B 
of finite fields, taking the coset ++p to +58. The degree of the field extension 
f = F(B/p) = [O/P : o/p] 


is called the residue class degree of $8 over p. 


EXERCISES 


1. Show that Nx /.() = ph PP), 


2. Let k C kK C L be number fields, p a prime in 0, $B a prime in K 
dividing p and Q a prime in L dividing 8. Show that Q|p and 


(a) e(Q/p) = e(2/P)e(B/p) and 
(b) F(Q/p) = FQ/R)F(B/P) 
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Theorem 4.6. Let K/k be an extension of number fields of degree n and p a 
prime ink. Let pO factor in K as in (4.1) and f; = f(B;/p). Then 


g 
y ej fj =n. (4.2) 
j=l 


Proof. It is easy to see that pM Z is a prime ideal of Z. Let pn Z = pZ = (p). 
Then o/p is a finite field of g = p* elements for some d > 1. Hence, Nzjo(P) = 
(q). Taking norm of both sides of (4.1), we have 


Nxjo(pO) = Nx/o(Bi' -- - B57). (4.3) 
But 
Nxjq(PO) = Nija(N/e(PO)) = Nejolp”) = Nesolp))” = (q"). (4-4) 
On the other hand, 
Nxo( Bi" ++ BG?) = Nxjo( Br") --- Nxje(By’)- 
Since, Nxg(B;) = (¢*), 
Neca Bi Bip) = (gehen teate), (4.5) 
Comparing (4.4) and (4.5), we get 
gr = qifit teats, 


This shows that 
n=efit---+egfy. 


Corollary 4.7. A prime p in k cannot have more than n = |K : k] prime 
divisors in k. 


Proof. The inequalities e; > 1, f; > 1 imply g <n. 


Example 4.8. Take k = Q, K = Q(Vd), d 4 0,1 a square-free integer. There 
are only three possibilities for a prime in Q to factor in K. 


1. (p) = p? (ramified) 
2. (p) = pipe with pi F pe (splits) 
3. (p) =p (inert). 


In general, there are quite a few possibilities. 
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Definition 4.9. Let p be a prime of k, [K : k] =n and 

pO =PE BH (6) >) 
with B),...,B, distinct. Then 


1. if e = n (which is so if and only if g = 1, e = e, = n) we say 
that p is totally ramified in K; 

2. if g =n (which is equivalent to e; = f; = 1 for each j) we say 
that p splits completely in kK and 


3. if f=n(Sg=1,e =e; = 1) then p is inert, or stays prime in 
K. 


Remark 4.10. Ifn > 2, there are many other ways for p to factor in kK. The 
way in which a given prime p of k factors in K is a fundamental problem in 
algebraic number theory. 


4.1 Criterion for Ramification 


We now start preparing to show that if K’/k is an extension of number fields, 
the number of primes which ramify in K is finite. In fact, we shall point out 
exactly which primes in k ramify in Kk. 


Definition 4.11. The complementary set of O relative to o is the set 


O'= {a E K| TrK/n(aQ) c o}. 


Theorem 4.12. The complementary set O' is a fractional ideal and contains 


O. 


Proof. It is obvious from the properties of the trace map and definition of 
O’ that O’ is an O-module and that O C O’. All we have to do is to produce 
a nonzero element d of 0 such that dO’ C O. 


Fix a basis aj,...,Q@, of K over k, consisting of elements of O. If a € Kk, 
write 
& = 410, +++-+Gna, (a; € k). 


Then for each 1 = 1,...,n, 


TrK/K (aa) = S04; TrK/K(Qia;) = b; € 0, 
j=l 
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or in the matrix notation 
ay by 
(Trin (aia;)) : => 


Gn bn 


Since the matrix (Tr /,(aia;)) € GL(n, 0), solving by Cramer’s rule, we see 
that da; € 0, where d = det(Trx,(a;a;)) is a nonzero element of o. 


Definition 4.13. The integral ideal 
Dkk = Oe 

is called the different of K/k. 
Definition 4.14. The ideal 0x /;, = Nijx(DK/x) Of 0 is called the discrimi- 
nant of K/k. 

The following is also clear from the proof of Theorem 4.12. 
Theorem 4.15. The ideal 0x /q is generated by the discriminant dx of Kk, 
that 48, 9K/Q = dgZ. 


The rest of the chapter is devoted to prove that a prime §8 of K is ramified 
if and only if |D%,/;,, and a prime p of k is ramified if and only if pld«/,. In 
particular, there are only finitely many primes of Q which ramify in a number 
field kK. These are exactly the primes which appear in the unique factorization 
of dx. 


4.2 Review of Commutative Algebra 


In this section, we recall some basic facts we need from commutative algebra. 
For details, see [3]. 


Localization 


Let A be a integral domain with 1, that is, a commutative ring with 1 such 
that for a,b in A, ab = 0 implies that either a = 0 or b = 0. The ring A is 
contained, in an obvious way, in its field of fractions kK. A subset S' of A isa 
multiplicative set if 


1. 16S but0¢S, and 


2. if 51,52 € S then 5152 € S. 
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sta={% 
Ss 


is a subring of K and contains A as a subring. Every ideal of S~!A is an 
extension of an ideal a of A in the sense that it is generated in S~'A by a, 
thus it is of the form a(S~'A) = S~'a= {4|a €a,s € S}. 


The subset 


acasest 


Theorem 4.16. The prime ideals of S~!A are in one-to-one correspondence 
with prime ideals of A, not intersecting S, via 


poP=S 'poPNA=p. 


If p is a prime ideal of A, then S = A \ p (the set theoretic difference) is 
a multiplicative set and $~1A is called the localization of A at p. The ring 
A, = S~'A is a local ring. [A local ring is a commutative ring with 1 having 
a unique maximal ideal.] The unique maximal ideal of S~'A is S~!p = pAy, 
which is again denoted by p. 


Example 4.17. 
1. Take A = Z and for a fixed prime p, S = {p"|n = 0,1,2,...}. 


Then $-1A = Z | -{2 meZr=01,2,...} 


i 
P 


2. Again let A = Z and p a fixed prime. Suppose p = pZ. The 
localization of A at p is the subring 


Ay = {= Jm,de Z,d > 1 and the g.c.d. (d,p) = 1} 
of Q. 


Theorem 4.18. Suppose A is an integral domain, and m a maximal ideal of 
A. Then the fields A/m and Aw /mMAm are isomorphic. 


Proof. It is easy to see that the map A/m > Am/mA, taking the coset 
a+min A/m toa+mAn of Am/mAm is the required isomorphism. 


Now suppose K/k is an extension of number fields, 0 = Ox, O=Ox,pa 
prime ideal in k, S=o0\p, A= S-10, B= S710 and % = Sp. 


Theorem 4.19. 


i) The integral closure of A in K is B. 
ii) B is a finitely generated A-module. 
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iti) There is an element 7 in p such that every ideal a of A is of the 
forma=p™ = (x™). In particular A is a principal ideal domain. 


Proof. i) and ii) are trivial. For iii), choose a 7 in Ox with vp(m) = 1. 


EXERCISES 


1. Show that B is a semi-local ring (a ring having only finitely many 
maximal ideals). What are these finitely many maximal ideals? 


2. Show that B is a PID, and hence a UFD and a Dedekind domain. 


4.3 Relative Discriminant for Rings 


Let A be a subring of a ring B (all subrings A are assumed to contain 1 of B) 
such that B is a free A-module of rank n. Recall the definition of the trace 
map trg/4: B — A as the trace of the matrix for multiplication map mp by 
b, for b in B. Note that the matrix of the linear map my, depends on the basis 
of B over A, but the trace of this matrix does not. 


Definition 4.20. Let A and B be as above. For a1,...,@, in B, the element 
A(aj,...,Qn) = det(tr(aja;)) 
of A is called the discriminant of the ordered set (a1,...,Qn) relative to B/A. 
Remark 4.21. 
1. The discriminants of two bases of B over A are related by the 
square of a unit of A, and hence are associates. 
2. In general, it is not true that A(a1,...,Q@n) 4 0 if and only if 
Q1,...,Qp are free over A. 
Definition 4.22. The discriminant of B over A is the principal ideal of A 
generated by the discriminant of a basis of B over A. It is denoted by 03/,. 


Let K/k be an extension of number fields with O = Ox and o = Ox. We 
know that O is a finitely generated o-module, but it may not be free (give a 
counter-example). So, in general, we need to modify this definition slightly. 


Definition 4.23. The discriminant 0x), of K/k is the ideal of 0 generated 
by the set of discriminants of all bases of K/k consisting of elements of O. 
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Remark 4.24. 


1. When the class number hx of k is one, all the definitions coin- 
cide, because every finitely generated module over a principal ideal 
domain is free. 


2. 0x«/~ iS a nonzero integral ideal of 0. 


4.4 Direct Product of Rings 


Suppose B,,..., B, are commutative rings with 1. We define their direct prod- 
uct as the Cartesian product B = B, x--- x B,, with addition and multipli- 
cation taken component-wise. Each B; may be regarded as a subring of B via 
the obvious inclusion map, e.g. By 3 6b; > (b1,0,...,0) € B. 


If A is a subring of each B;, then A may be regarded as a subring of the 
direct product B = B, x --- x B,, via the map A 35a — (a,...,a) € B. 


Theorem 4.25. Suppose A with 1 is a subring of each B; and every B; is a 
free A-module of rank nj. Then the direct product B = B, x--- x B, is a free 
module of rank ny +--+: +n,. Moreover 


OB/A = 0B, /A°+* 9B, /A- (4.6) 
Proof. We only need to prove (4.6). To simplify notation, we prove it for 
r = 2. For r > 2, the proof is similar. 


Put ny = m and ng = n. Let aj,...,Q@m be a basis of By, over A and 
B1,---,;Bn be a basis of By over A. As A-modules, if we identify B, and 
By with the submodules B, x {0} and {0} x By of B = By, x Bo, then 


{a1,..-,Qm;81,---,;8n} is a basis of B over A. Moreover, for all i, 7, we have 
ai 8; = 0. Hence A(ay,...,m;$1,.--, Bn) is the determinant of the matrix 
( trp, /a(aa;) | ) 
| trp, /a(Gi8;) 


This shows that 


A(aj,... ,Qm}3 1, - ee Bn) = A(ai,..-,Q@m)A(f1,- ans Bn): 


Therefore, 03/4 = 0p, /A0B,/A- 


Suppose A is a subring of B. Let a be an ideal of A and b = aB be the ideal 
of B generated by a. For a in A and f in B, let @ and £ denote the residue 
class of a in A/a and that of 6 in B/b, respectively. 
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Theorem 4.26. Suppose {61,...,Bn} is a basis of B over A, such that 
{B,,.-.,8n} is a basis of B/b over A/a. Then 


A(B1,--+;Bn) = A(B1,--++Bn)- 


Proof. Suppose 6 € B. If the matrix of the map “multiplication by '” 
with respect to the basis $1,...,8n is (aij) in M(n, A), then the matrix of 
the multiplication by 6 map with respect to the basis 6,,...,6,, is (@,;) in 
M(m,a/a), which shows that 

trp/a(8) = t(B/6)/(4/a) (8). 


Hence, 


A(61,-.-,;8n) = det(tr(6;6;)) = det(tr(8;5;)) SA Bice) 


Theorem 4.27 (Chinese Remainder Theorem). Suppose A is a commutative 
ring with 1 and ay,...,a, are pairwise coprime ideals of A, t.e.a;+aj = A 
fori Aj. Given ai,...,a, in A, there is an element x in A, such that 


t=a,; (mod a;) 

for all j =1,...,r. Moreover, if for another y in A, 
y=a,; (mod a,) 

for all j =1,...,r, then 


x=y (mod a,---a,). 


Proof. Put 6; = [],4;4j;. Then a; + 6; = (1). To prove the theorem, now 
choose 6; in 6; such that 6; = 1 (mod a,), which implies that b;a; = a; 
(mod a;). Clearly, b;a; = 0 (mod a;) if i 4 j. Hence if we put 


t= S- bja;, 
i=l 
then 
x=a,; (mod a;) 
for all j =1,...,n. If also 


y=a; (mod a;) 


for all j, then x — y is in every a,j, hence in a; ---a,. 


Corollary 4.28. If a,,...,a, are pairwise coprime ideals of a commutative 
ring A with 1, anda =a ,---a,, then the quotient ring A/a is isomorphic to 
the direct product A/a, x... x A/a,. 
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Proof. By the Chinese Remainder Theorem, the map 


A/a>xmoda-— (xmodaqj,...,2moda,) € A/a; x --- x A/a, 


is a bijective ring homomorphism. 


Corollary 4.29. If é(m) is the Euler ¢-function, 
go(m) = Card{a]1 <a<m, g.c.d.(a,m) = 1}, 


then 


dee; ) 


Proof. By definition, ¢(m) is the cardinality Card (Z/mZ)* of the group of 


units of the quotient ring Z/mZ. If m = p*, then for 1 <a < m, (a,p) > 1 if 


and only if a = p, 2p, 3p,...,p7~'p. 


Hence ¢(m) = p?—p*1 = p4# (1 - 1). Ifm = pe .-- p#, then by Corollary 
4.28, 


(Z/mZ)* & (Z/pUZ x ... x Z/peZ)* 
= (Z/p@Z)* x... x (Z/peZ)*. 
Therefore, 


o(m) = Card (Z/mZ)” 
= Card (Z/p"Z)* ... Card (Z/p%Z)* 


1 1 
= ph (1- =)... (1--) 
Pl Pr 


-II('-3) 


p\m 


4.5 Nilradical 


Definition 4.30. An element of a commutative ring A with 1 is nilpotent if 
a™ = 0 for some m in Z. 


Theorem 4.31. The set nil(A) of all nilpotent elements of A is an ideal of 
A. 
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The ideal nil(A) is called the nilradical of A. 


Proof. Let x,y € nil(A). Then for some m,n in N, x” = y” = 0. Ifl = m+n, 
then it follows from the Binomial Theorem, that (x + y)! = 0. On the other 
hand, if a € A, then (ax)” = a™x™ = 0. This proves that nil(A) is an ideal 
of A. 


Theorem 4.32. The nilradical, nil(A), is the intersection of all prime ideals 


of A. 


Proof. If x in A is nilpotent, then for some m in N, x” = 0. Hence z € p, 
for all prime ideals p of A. 


Conversely, suppose x is not nilpotent, that is «” 4 0 for all m in N. We 
show that there is at least one prime ideal p such that x ¢ p. Let S' be the set 
of ideals a of A, such that x” ¢ a for all m in N. Clearly, S is not empty, since 
the zero ideal (0) € S. By Zorn’s Lemma, let p be a maximal element of S. We 
shall show that p is prime. If not, then there are x, y in A\p with zy in p. Then 
the ideals a = (p,x) and 6 = (p,y) both properly contain p. By the choice 
of p, for some m,n in N, x™ € a, 2” € b. This shows that 2”t” € ab C p, 
implying p ¢ S. This contradiction proves that p is prime. 


4.6 Reduced Rings 
Definition 4.33. A commutative ring A with 1 is reduced if nil(A) = (0). 


Example 4.34. 


1. An integral domain is reduced. 
2. The product A; x... x A, is reduced if all A; are reduced. 


Theorem 4.35. Suppose K is a number field and B a prime ideal of O = Ox. 
The quotient ring O/38° is reduced if and only if e = 1. 


Proof. If e = 1, then O/ is a field, hence reduced. On the other hand, 
if e > 0, choose 7 in $ — ¥%?. Then 7 4 0 in O/P*, but 7° = 0 in O/PE°. 
Therefore, O/8° is not reduced. 


Now let A be a subring of a commutative ring B, both with 1. Suppose B 
is a free A-module of rank n. 
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Theorem 4.36. [f A is a finite field, then B is reduced if and only if 0B/4 # 
(0). 


Proof. First suppose that B is not reduced, that is, it has a nilpotent element 
a #0. A being a field, a can be completed into a basis a; = a, Q2,...,Qp, of 
the vector space B over A. Now all the elements aja;, 7 = 1,...,n are also 
nilpotent and since the matrix for a nilpotent element is also nilpotent, its 
trace is zero (why?). Hence the first row of the matrix (tr(a1a,;)) consists of 
zeros only, which shows that 


A(aj,...,Qn) = det(tr(aj;a;)) = 0. 
Therefore, 0574 = (0). 


Conversely, suppose B is reduced, i.e. nil(B) = {0}. Since nil(B) is the 
intersection of all prime ideals and B is finite, 


(0) = Pin... NB, Bi AP; for i Fj). 


Every B/B,, being a finite integral domain, is a field, hence all $B; are maximal 
and therefore coprime in pairs, and 


Bin... AB, = Pi... Be. 
By the Corollary 4.28, 
B= B/(0) = B/®,---B, = B/P, x... x B/P,. 


By Theorem 4.25, 
9B/A = 9(B/1)/A ++ 9(B/B,)/A- 
Since A is a field, each 0(8/,)/a # (0). Hence p/4 4 (0). 


4.7 Discriminant and Ramification 
Finally we arrive at the main result of this chapter, a criterion for ramification, 


proved by Dedekind in 1882. 


Theorem 4.37. Suppose K/k is an extension of degree n of number fields. 
A prime p of k ramifies in K if and only if p|0K/r- 


Proof. Let 
pO =i... 
be the factorization of p into powers of distinct primes in O. Since the ring 


O/pO = O/PE ... PS ¥ O/PL x... x O/BG, 
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p is ramified = some e; > 1 = O/; is not reduced = O/pO is not reduced 
 0(0/po)/(o/p) = (0). Thus we need to show that 0(0/po)/(o/p) = (0) © 
ple. 

Let S = 0\p, A the localization of o at p, B = S~10, YP = S~'p, the 
maximal ideal of A. Since O is a finitely generated (but not necessarily free) 
o-module, B is a finitely generated A-module, generated by the same elements. 
Now since A is a principal ideal domain, B has a basis over A, easily seen to 
consist of n = [kK : k] elements a1,...,@n. Since S does not intersect any of 
the prime ideals of O lying above p, we have the following diagram: 


O/pO ~ B/BB 
| | 
ofp = A/P 


For 6 in B, we denote by £ its residue class in B/S$8B. The dimension 
of O/pO over o/p is n and so is the dimension of B/$BB over A/P. Since 
@1,..-,@n, generate B/38B over A/B, by comparing dimensions, they must 
form a basis of B/3$8B over A/38. Thus by Theorem 4.26 and the diagram 
above, 0(@/po)/(o/p) = (0) if and only if A(ai,...,a,) = 0. Thus we show 
that p|0%/, if and only if A(ay,...,an) € PB. 

First, let A(ay,...,Qn) € B. If {f1,..., Bn} is a basis of K over k consisting 
of elements in O, then 


B; = ye Ajj; (aiy € A) 
j=l 


which shows that A(S1,...,8,) = det(Tr(a;a;)) - (det(a;;))? € ON P = p. 
Hence, 0x /, © p, ie. pl0xK/,~- Conversely, suppose pldx 7x. If a1,...,an is a 
basis of B over A, write each a; = 6;/s with 8; in O and s in S. Then 


Aoi Gator) = a det (tr(3,8;)) 


) 


Geometry of Numbers 


For a commutative ring A with 1, we denote by A” its group of units, that is 
A*X = {u € A| vu =1 for some v in A}. In this chapter, we shall show that 
the group O; of units of a number field K is finitely generated. To motivate, 
let us take a square-free integer m > 1. For the sake of simplicity, let m = 2,3 
(mod 4), because then for K = Q(./m), Ox = Zi,/m] = Z@ Z./m. Now 
u=x+y/me Of if and only if the norm 


N(u) = 2? — my? = +1. (5.1) 


Thus in the simplest case of the quadratic field K = Q(./m), the determina- 
tion of OX is equivalent to solving the Pell equation (5.1). 


5.1 Lattices in R” 


If a € R” and r > 0, we call the subset 


B,(a) = {@ € R"| dist(a, a) = ||a — al| <r} C R” 


the open ball of radius r, centered at a. A subset X C R” is discrete if for 
each a in X, there is an r > 0, such that X 7 B,(a) = {a}. Consider R” as an 
Abelian group under addition. A lattice in R” is a discrete subgroup L 4 {0} 
of R”. Let d be the dimension of the subspace of R” spanned by elements of 
a lattice L C R”. Clearly, d <n. We call d the rank of the lattice L. A lattice 
LCR” is a full lattice if its rank is n. 


Remark 5.1. Topologically speaking, LZ C R” is a full lattice if and only if 
the quotient space R”/L is compact. 


EXERCISE 


Show that DL C R” is a lattice if and only if it is a Z-module 


L=2Zv,6...@Zvq (5.2) 
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for some vectors V1,...,Uq in L. The expression (5.2) means that each v in L 
has a unique representation 


Vv =ajv, +:::+aqva 


with ay € Z. 


Hint: If d = 1, choose v1 # 0, a vector in L nearest to 0. This is possible, 
because L is discrete. Then, clearly every vector v in DL has a unique represen- 
tation v = av, for a in Z, for if v = (a+r)v, withO <r <1, then rv; € L, 
contradicting the choice of v,. For d > 1, use induction on d. 


We now give a characterization for a lattice to be full, which is more suitable 


for our purpose. 


Theorem 5.2. A lattice L C R” is full if and only if there is a bounded set 
Y CR” such that 
R” = Usger(v + Y). (5.3) 


Here, v+ Y = {v+y|y € Y}. Before proving the proposition, we define a 
useful term. 


Definition 5.3. Let 
L=Zv,6...@Zv, (5.4) 
be a full lattice in R”. The set 


P= {ey +++ ¢,u,|0< c < 1} (5.5) 


is called a fundamental parallelepiped of L. It depends on the Z-basis 
{vi,...,Un} of L. Clearly P is bounded and 


R” = Uver(v+ P), (5.6) 
a disjoint union of translates v + P of P by elements of L. 


Proof. If L is full, we can take Y to be a fundamental parallelepiped of L. 


Conversely, suppose a bounded set Y C R” exists with the property (5.3) 
and L is not full. We show that this leads to a contradiction. 


Let W be the subspace of R” spanned by the vectors in L. Then d = 
dim W <n. Consider R” as an inner product space with the dot product of 
vectors. Choose a unit vector vay1 (by the Gram-Schmidt Process) which is 
perpendicular to every vector of W. Let r > 0 such that Y C B,.(0). It is easy 
to see that if w = avai, is a vector in R” with a > r, then w ¢ Upyet(U +Y). 
This is a contradiction. 
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5.2 Minkowski’s Lemma on Convex Bodies 


The Dirichlet’s unit theorem asserts that, up to the roots of unity in K, the 
group O; of units of K is a free Abelian group of rank r = r; + ro — 1. [We 
shall define the non-negative integers 7; and rz in the next section.] It is not 
very difficult to show that r < ry +r2g—1. The harder part that r = ry +r2—-1 
follows from the famous lemma of Minkowski on convex bodies. 


A subset X C R” is convex if for all u,v in X and all real ¢ in the interval 
(0, 1], the vector tu + (1 — t)v is in X. That is, the line segment joining u to 
v is entirely in X. It is easy to see that if X is convex in R™ and Y is convex 
in R”, then X x Y is convex in R™T”. We call X C R” centrally symmetric 
if v © X implies —v € X. 


Let ys be the Lebesgue measure on R”, that is, the measure on R”, such that 
for a cube X C R” given by 


n 


p(X) = vol(X) = ]] (0; — a5). 


j=1 


Let L be a full lattice with a fundamental parallelepiped P, as in (5.4) and 
(5.5). Of course, P depends on the choice of the Z-basis {v1,...,Un} of L. 
However, any two Z-bases of L are related by a unimodular matria, that is a 
matrix of determinant +1 with entries in Z. Since y(P) is the absolute value 
of the determinant, whose rows are V1,..., Un, it follows that the volume p(P) 
of P is independent of the choice of the basis. Thus, we may denote u(P) also 


by (ZL). 


Theorem 5.4 (Minkowski’s Lemma). Suppose X C R” is a bounded, cen- 
trally symmetric convex set and L C R” is a full lattice. If u(X) > 2" u(L), 
then X contains a nonzero vector of L. 


Proof. First we show that if Y C R” is a bounded set, such that {v+Y|v € 
L} is a family of disjoint subsets of R”, then w(Y) < u(P), where P is a 
fundamental parallelepiped of L. This is almost immediate, because writing 
Y as the disjoint union 


Y=UyerY 1 (ut P), 
we have by (5.6), u(Y) = ye, MY A (vu + P)). 


Since yp is translation invariant, w(Y N (v + P)) = u((—v + Y)N P). Hence 
MWY) = Vyepu((-v + Y)NP) < w(P), because the sets —v + Y are also 
pairwise disjoint. 
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Take now Y = 4X = {$v|veE X}. It is given that w(Y) = #- u(X) > 
u(P). Hence the translates v + Y, v € L of Y are not pairwise disjoint, that 
is, there are two distinct vectors v,,v2g in LD, and uj,,u2 in X, such that 
vt 4 Uy = vot ; U2. Since X is centrally symmetric and convex, this shows 
that vy — v2 = 5 U2 5 Uy X. Since v1 — V2 is a nonzero element of L, we 
are done. 


Remark 5.5. In Minkowski’s Lemma, the hypothesis u(X) > 2”u(L) may 
be replaced by u(X) > 2" (ZL), if X is compact. 


Corollary 5.6. Suppose Y C R” is a measurable set. If Uver(vu + Y) = R”, 
then u(Y) > y(P). 


5.3 Logarithmic Embedding 


Suppose Kk C C is a number field of degree n over Q. Consider a ring homo- 
morphism o : K — C. We require that o(1) = 1. Hence ojg = 1g, the identity 
map on Q. Such a a is clearly injective. [Its kernel Ker(c) is an ideal of the 
field K, which can only be {0} or K.] Hence, we call a a Q-isomorphism of 
K into C. There are exactly n Q-isomorphisms of K into C. To see this, write 
K = Q(a). If o is a Qisomorphism of K into C, it is determined by o(a), 
which is a conjugate of a. But there are exactly n conjugates of a over Q. 


One may regard such ao : K — C also an injective linear transformation 
of vector spaces, when K and C are viewed as vector spaces over Q. Unless 
stated to the contrary o : kK — C will be a Q-isomorphism. 


If o(K) CR, we call o a real imbedding, otherwise it is a complex imbedding. 
If o is complex, the map o : K — C, given by a(x#) = a(x) is also a Q@ 
isomorphism. Thus, the complex Q-isomorphisms occur in pairs. We shall 
denote the real Q-isomorphisms of Kk into C by o1,...o0,, and the re pairs of 
complex ones by 07,41, 07,413 --+3%r,4+ro:r,4ro- In particular, n = 71 + 27o. 


Consider C as a vector space of dimension two over R with {1,7} as the 
standard basis. If z = 7 + iy € C, the multiplication by z is a linear transfor- 
mation of C into itself over R. Its matrix relative to the basis {1,7} is easily 


seen to be T = @ ) with determinant 


det(T) =a? + y? = |2|?. (5.7) 


If we identify C, as a vector space over R with R?, via the map + +iy > (i 
then R™ x C™ = R”. 
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For a fixed vector a = (%,...,Up,} 21,---;2r,) of V = R™ x C”™, the map 


_— / Tol / f Te / / 
B= (iseses Bea) Cis lg ii ee) 


defines an R-linear map from V to itself. The determinant of its matrix, which 
we also denote by a, is 


det(@) = @1 +++ 2p, |z1|? «++ |zr9|?, (5.8) 
in view of (5.7). 
We now define a Q-linear map 
p: kK >~V=R" xC”? =R" 


by 
P(Q) = (01(Q), «++ Ory (@)j Or 41(@), +» + Orr tr2(@)): 
It is clear from (5.8) that 


det(p(a)) = N(a), (5.9) 
where N is the norm Nyx/g, as defined in Chapter 3. 


Theorem 5.7. If a1,...,Q@y is a basis of K over Q, then {p(az),..., p(@n)} 
is a basis of R” over R. In particular, p(Ox) is a full lattice in R”. 


Proof. All we need to do is show that the determinant 


(1) (1). Q) ; a) (1) 
14 


ry 1 Ory ry 41 Yr t+ Poe T1i+T2 ryt+re 
d= #0, 

aa al My ales Mn 

where 
tid av ifl<r<ry 
Or\As) = é 
—* a 4iy) ifry<r<rytro. 

For ry <r Sritre, §” = 3 (op(as)+Fr(as)) and ys” = # (0,(as)—F-(as)). 


Substituting this in the above determinant and performing the obvious column 
operations, one gets 


, o1(Q1)---Or,(Q1) Or, 41(A1) Fr, +1(@1) 
d = ———_— : 
(—21)"2 = 
Oil ig a ory (Qn) Or, 41(An) Or 41 (ln) 
1 
ees) ee 
(—2i)” (a1, »a ) 
The determinant D(a1,...,@,,) is related to the discriminant d* of the basis 


Q41,.--,Qn of K over Q by d* = D?(a4,...,an). Since d* 4 0, then likewise 
dH: 
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Corollary 5.8. The volume of the fundamental parallelepiped of the full lattice 
p(Ox) is sy dk, dx being the discriminant of the number field K. 


Proof. If Ox = Za, ®---® Zan, then dx = D?(a1,...,Qn). If P is the 
fundamental parallelepiped of p(O;), then the volume p(P) of P is, up to 
sign, the determinant whose rows are p(a1),..., (Qn). Hence 


1 1 
u(P) — gry | (a1, -+-An)| = Bra VJ |dx\. 


We now define a map A: K* — R"*", called the logarithmic imbedding 
of K* in R™+" by 


Ma) = (log |or(a)],.-- Jog Jor, (@)|; log lor, 41(@)|?,--- slog |ors +2 (@)|?). 


[Here log is the natural logarithm to the base e.] Clearly A(@B) = \(a) + (8), 
hence 2 is a group homomorphism from the multiplicative group K™ to the 
additive group R™*"2, 


Theorem 5.9. \(O;) is a lattice in R™*". 


Proof. We only need to show that the subgroup \(Oj) of the additive group 
R™+"2 is discrete. For t > 0, consider the subset 
X = {(log|oi(a)],-.- log lor, (a)|; log lor 41(a)|*,.--, 


log |or, +ro(@) |?) a € K*} 9 By(0) 
of R™+"2, For a vector in X, we then have 


log |a;(a)|<tifl<j<n 


and 
log |oj(a)|? <tifri<j<ritre. 
This means 
loj(a)|<e ifl<j<n 
and 
|o;(a)| < et/? if ry <jg<rmt+re. 


Since, by Theorem 5.7, p(Ox) is a lattice in R”, there are only finitely many 
a in OF with X(a) in X. 


Theorem 5.10. The kernel, ker(X) of the restriction map, also denoted by 
A: OF 4 Rt"? is a finite group, and consists of precisely the roots of unity 
in Kk. 
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Remark 5.11. We denote the group of roots of unity in K by Wx. The 
theorem asserts that ker(A) = {a € OF| A(a) = 0} is the finite group Wx. 


Proof. If ¢ € Wx, then o(¢) € Wx for every Q-isomorphism o of K into 
C, hence |o(¢)| = 1. Therefore, \(¢) = 0, which shows that Wx C ker(,). 
Conversely, every component of the vector p(q@) for @ in ker(X) has absolute 
value 1. Since p is injective and p(O;) is discrete, ker(A) is finite. Hence for 
a in ker(A), the powers a,a?,a°,... of a repeat, ie. for some t > s, at =a’, 
which shows that a” = 1 form =t— s. Hence ker(A) C Wr. 


Theorem 5.12 (Dirichlet 1846). Let r =r) +r2—1. The group of units OF 
is isomorphic to Wx x Z". In other words, there are r units uj,...,Ur in OF 
such that every unit u € oe has a unique representation 


U= Cup +--uee 


with ¢ in Wy and a; in Z. 


Dirichlet’s unit theorem, Theorem 5.12, now follows at once from Theo- 
rem 5.10 and the first isomorphism theorem in group theory, if we prove the 
following fact. 


Theorem 5.13. The rank r of the lattice \\OX) is given by r =1ry +r2—1. 


Proof. The units u of Ox are characterized by N(u) = [[, o(u) = +1, 
where the product is over all Q-isomorphisms of K into C. Therefore for 
a € OX, log oy (a) +: +-+log lor, (a)|-Hog lor, 41(a)[2-+-+ Hog [ors 4ra(a)[? = 
log lor (a) Ory (Q)-Or, 41(@) Or, 41(@) “' Ori+re (Q)-Or4r. (a)| = log |N (a) 7a 
0. Hence A(O;Z) is contained in the hyperplane H of R™*" defined by the 
equation 


ey ta =O: 
Since dim H =r; + rz — 1, the rank of (Of) < ri t7r2-1. 


We use Minkowski’s Lemma to show that r =r, +r2—1, that is, \(OZ) is 
a full lattice in H. 


For @& = (@1,..-,%r,321,-++;2ry) nn V = R™ x C™, we define its norm N(a@) 
by 


N(q@) = a1 +++ tp, 21)? +++ [2ral?. 


Let S be the subset of V of elements a with |N(@)| = 1. The map1: S > 
R™+"2 defined by 


i(a) = (log |a1|,..., log |x, |; log |z1|?, ..., log ele) 


has the following properties. 
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1. The image l(S) of S is the hyperplane H C R™*" given by the 
equation 
Arte: +A tr, = 0. 


2. The image l(X) of a bounded set X C S is also bounded. 
(1) is obvious. To prove (2), let |x;|,|zj|? < C for all 7. Then the coordinates 
of points in I(X) satisfy the inequalities log |x;|, log |z;|? < logC, and hence 


are bounded above. On the other hand, for @ = (%,...,%p,321,--+,2Zrg) im 
X CS, |N(a@)| = 1, hence 


T1 r2 
S “log |ar5| ~ S/ log |z;|? = 0. 
j=l j=l 


This gives 
ives -( Dees + Desist") 
j#i 
> —(ry +12 — 1) logC 
and 


log |2i|? = -( Dee Jey] + Ye les?) 
j#i 
> -(ry +172 — 1) log C. 
Hence the coordinates of a point in /(X) are also bounded below. Hence /(X) 
is bounded. 


To prove that \(O;) = l(p(Ox)) is a full lattice in the hyperplane H, by 
Theorem 5.2, it is enough to find a bounded set in H whose translates by the 
elements of \(O%) cover H. Since \(S) = H and for X C S, l(p(Oz)- X) = 
l(p(Oj-)) +1(X), it suffices to find a bounded set Y C S, such that 


3 =Y - p(OX) = {y- plu)ly in ¥,u € OX}. (5.10) 


[Here the product y - p(u) is componentwise in V = R™ x C™.] We take for 
a>0 
X = {(1,..., 27,3 21,-+-52rg) € S| |w;| and |z,;| < a} 


It is easy to see that its volume 


p(X) = fo fates dey, / bts [tedin= (2a)" (1a?)"? = 2" n"™a”. 


—a lZrgl<a_ |zi|<a 
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To apply Minkowski’s Lemma, we choose a large enough so that u(X) is big- 
ger than 2” times the volume of the fundamental parallelepiped of p(O%). We 
already computed (see Corollary 5.8) the volume of the fundamental paral- 
lelepiped to be 545 \/|dx|. Hence our choice of a is such that 


p(X) > 2”. — V/|dxl, 
i.e. 


ay 
n> ay om 


To find Y for which (5.10) holds, let # € S. We replace the full lattice 
L = p(Ox) CV = R" x C™ by M = a-L. Since the map, y > x«-y 
(componentwise multiplication in V) is an invertible linear map, M is also 
a full lattice in V. Moreover the determinant of this linear map is equal to 
N(a) = +1, which implies that the volume of the fundamental parallelepiped 
is invariant under this map. Therefore, the volume of the fundamental paral- 
lelepiped of xM is also aE \/|\dx|. By Minkowski’s Lemma, applied to the set 
X and the lattice M = xp(Ox), we find a 40 in Ox such that y = a - p(a) 
is a nonzero element of X. Since N(a) = +1, 


e=iltee) (o(a)| =|N(y)| <a”. 


[The product [], o(@) is over all the Q-isomorphisms o of K into C.] In 
Chapter 3, we proved that there are only finitely many ideals of bounded 
norm. In particular, there are only finitely many principal ideals of bounded 
norm. Since the norm N((q)) of a principal ideal is the same as the absolute 
value of the norm N(a) = [[, o(a) of a, this implies that, up to multiplication 
by units of Ox, there are only finitely many a in Ox with |N(a)| < a”. Call 
them a1,...,Qm. Therefore, a = wa; for some j (1 < 7 < m) and hence 


y = xp(a) = xp(u)p(a;), ie. & = p(u-")p(az")y. 


Since p(a; '\y and # both have norm +1, pla; *)y e Sn p(a5*)X, which 
shows that ze OF) (SN pla; 1)X) for some j (1 < j < m). 


Now we put 
Y =U%%1($ 1 p(ajz*)X). 


Since a and a, were independent of x, every element x of S' is in p(OZ)-Y. 
Since each pla; *)X is bounded, so is Y. This proves (5.10), concluding the 
proof of Dirichlet’s unit theorem. 


Remark 5.14. In Dirichlet’s unit theorem, the units u1,...,u,;, which gen- 
erate the free part of the group Oj, are called fundamental units. The set 
{u1,...,u,} of fundamental units is not unique. What is unique is the cardi- 
nality r of this set. We call r the rank of OF. 
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5.4 Units of a Quadratic Field 


Let K = Q(Vd) (d 4 0,1, a square-free integer) be a quadratic field. We call 
K a real quadratic field or an imaginary quadratic field according as d > 0 
or d < 0. If K is an imaginary quadratic field, then r; = 0, rg = 1, so 
r=ryt+r2—1=0. In this case, OF = Wx, the roots of unity in kK. We leave 
it as an exercise to determine this finite group Wx. 


For the real quadratic field, r = 1 and the group of units is given by the 


following corollary. 


Corollary 5.15. If d> 1 is a square-free integer and K = Q(Vd), then the 
group 


oF — +1} x Z. 
In particular, the Pell equation x? — dy? = 1 has infinitely many solutions in 


integers. 


EXERCISES 


1. Determine the structure of Of when [K : Q] = 3 and 4. 


2. Use Dirichlet’s unit theorem to find all integer solutions of 5a? — 
hye Sy 


5.5 Estimates on the Discriminant 


In view of Dedekind’s theorem on ramification, it is clear that the discrimi- 
nant is the most important invariant of the ground field & for studying the 
ramification of primes of k in a finite extension K/k of number fields. Thus 
a minimal knowledge of this invariant is absolutely essential for studying the 
arithmetic in such extensions. In this section, we obtain some classical results 
on the discriminant dx for K/Q. 


For non-negative integers d and m, let n = d+2m. We put V = R¢ x C”, 
which when viewed as a vector space over R has dimension n. The Lebesgue 
measure on V will be denoted by yu. For n = 1, it is length, for n = 2, it is 
area and for n = 3, it is volume, etc. We identify C with R?, via the map 
CSz=x2+iy- (x,y) € R. For areal number a > 0, we put 


d m 
Sa(d,m) = {(x1,..-,; a} Z1,---;2m) € V | S lag) +250 |z5| <a}. 
j=l j=l 
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Clearly, S.(d,m) is a compact, convex, measurable and centrally symmetric 
subset of R”. 


Theorem 5.16. 


n 


(Sa(d,m)) =24(2)". 


a) tags (5.11) 


Proof. We use double induction on d and m. We have 


“(Sa(1,0)) = u([-a, a]) = 2a, 


which agrees with (5.11). Next, 


which also agrees with (5.11). 


To compute ju(S,(d+1,m)), we integrate last with respect to the (d+ 1)!” 
variable x, and use the symmetry. So by the induction hypothesis, 


= pdt (5)" qg@tl 
2) (n+) 


Because n = d+ 2m, if (5.11) is true for d, then it is true for d+ 1 also. 


To compute p(S_(d,m + 1)), note that S,(d,m + 1) is defined by 


d m 
So las] +250 zl + 2lzl <a. 
j=l j=l 


If we fix the (m+ 1) variable until the end, by the induction hypothesis, 


a — 2|z|)” 


U(Sq—2)2|(d,m)) = 94 (5)" ( 


n! 


Finally, integrating with respect to the last variable z, 


n! 


w(Sa(dm+1))=29(S)" f(a —2Ia))raulz) 


lzI<$ 


68 Geometry of Numbers 


We put z= re”, 0<r< $,0<6< 20. Then 


) (a — 2|z|)"du(z) = / [io—2»yrrarao 
0 0 


lzI<$ 


- an f (a — 2r)"rdr 
0 


T qnt? 


2° (n+1)(n +2)’ 


Hence, 


a\m+tl qé+2(m+1) 
1(Sa(d,m + 1)) = 24 (2) 4 


2 d+2(m+1))!" 
This completes the proof by double induction. 


Now let kK be a number field of degree n = 71 +2rg. Let o1,...,0r, be the ri 
real Q-isomorphisms of K into R, whereas 0,41, 07,413 ---3 Or; tre; ry +r. the 
rg pairs of complex Q-isomorphisms of K into C. In particular, n = r, + 2re. 


Again, let 
p:K>V=R" xC”? SR” 
be the canonical imbedding of K in V. It is given by 


p(a) = (o1(a), sey Ory (a); Or, +41(@), +++ Oritre (a)). 


We shall need the following fact. 


Theorem 5.17. For non-negative real numbers 11,...,2n, show that 


Up sR ek Ba 
W/L ++ Ly < 


n 


i.e. the geometric mean never exceeds the arithmetic mean. 


Proof. Use the Lagrange method of multipliers to maximize the function 
f(a1,...,2n) = (£1...2n)'/" subject to the condition 7] +--- +a, =c,a 
constant. 


Theorem 5.18. Suppose K is a number field with [K :Q) =n =11 4 2re. If 
a # (0) ts an integral ideal in K, then a contains a nonzero element a, such 
that 


|IN(a)| < (2) m ldx| - N(a). 
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Proof. Recall that for a lattice, 
L=Zv,0...0Zvy, 
in R”, the volume p(P) of its fundamental parallelepiped 
P= {avi +--+: +anv7|0 < a; < 1} 


is independent of the Z-basis {v1,...,Un} of L. Hence we may denote it by 
p(L) and call it the volume of L. If M C L is a full sublattice, of index 
[L : M] = m, then a fundamental parallelepiped of M is a disjoint union 
of m translates of a fundamental parallelepiped of DL. Since py is translation 
invariant, 


u(M) = mp(L). 
In particular, if a is a nonzero (integral) ideal of O = Ox, then 


(o(a)) = H((0)) -N(a) = side] (a. 


Now choose a > 0 such that 


T\rT2 a” 
a (FY <= w(Sal(r,r2)) 
n! 
= 2" u(0(0)) N(a@) 
= 2" 2° \/|d| N(a), 
1.€. 
ANT 
"=n! (= dx| N(a). 
want (2) Vide 
By Minkowski’s Lemma for (compact) convex sets, there is a nonzero element 
a of a, with p(a) € Sa(ri,r2). Hence, by Theorem 5.17, 


N(a) =|] ola) < = (Siew <f_m (-) * /ldxl N(a). 


(Again, here the product and the sum are taken over all Q-isomorphisms of 
K into C.] 


Corollary 5.19. Every ideal class contains an integral ideal a with 
A\" nl! 
N(a) < (=) 7m Vidkl: 


Proof. Take an ideal 6 in the given ideal class. Multiplying by a principal 
ideal, we may assume that 6~! is integral. By Theorem 5.18, choose a in b~! 


such that sad 
4\"? n! 

<(-=}] —V|dx|-N(67'). 12 

mol s (4) 3 vided v6) (5.12) 
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Clearly, a = (q@)6 is an integral ideal, and by (5.12), 


TT 


N(a) = N((a)6) = |N(@)iN() < (2) Vel 


Definition 5.20. The constant 
A\" nl! 
C(K) = (=) ae 
T n 
is called Minkowski’s constant for kK. Note that C(K) > 0 as n— ow. 


Example 5.21. For K = Q(V—5), Ox = Z/V—5], hence 
2 


dx = ; _f=5| = 20. 
Here n = 2, r; = 0, rg = 1. Therefore, Minkowski’s constant is given by 
4 2 2 
C(K) = =-. 


qn 22 


EXERCISE 


Show that for all integers n > 1, 
AN Tah nx (3n\""* 
An = | — —) >a,=5 (— : 
(=) Ge) 20-5 G) 
Hint: Use induction on n to show that for all n > 2, 4" <1. 
Theorem 5.22. If K is a number field with [K :Q|)=n> 1, then 


n-1 
nt (30 

dx| > —{ — ; 
axl > 5 (F 


In particular, the degree |K : Q|) < Clogdx, where C > 0 is an absolute 
constant. 


Proof. If a 4 (0) is integral, then N(a) > 1. Hence by Corollary 5.19, 


m\2r2 (nr\? 
lax > (J) (3). 
= 


n 
(2)", and hence by the Exercise 


a) 


But | < land 2r2 < n. Therefore, (a 


above, 
2 
T\n ne T 
= . > 
idx > (4) (=) — 3 
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Theorem 5.23 (Minkowski 1891). Let dx be the discriminant of the number 
field K. Then dx =1 if and only if kK =Q. 


Proof. It suffices to show that if [K : Q] =n > 1, then |dx| > 1. This is 
obvious by Theorem 5.22, because % > 1 and 27 > 1. 


We need the following proposition to prove another important theorem, due 
to Hermite, namely, that there are only finitely many number fields of a given 
discriminant. 


Proposition 5.24. Given an integer n > 1 and a constant c > 1, there are 
only finitely many algebraic integers in C of degree n over Q, such that all the 
conjugates of a (including a itself) are bounded by c. 


Proof. If a is an algebraic integer, then so are its conjugates. In fact, they 
all satisfy the same (irreducible) monic polynomial f(x) in Z[x]. Further, if 
Q| = 4, Q2,...,Qp are all the conjugates of a, then 


= 2" — (ay ++: +a, )a™! + (aiag +++ +g? ? — + + (—1)" a... cy. 


It follows that the integer coefficients a; of f(x) are symmetric functions of 
Q1,---,Q@p, and satisfy 


Jaj| < (‘) oF Shc = (2c). 
This bound is independent of a and depends only on c and n. Therefore, there 


are only finitely many possibilities for the integer coefficients a; of f(x), and 
hence for a. 


EXERCISE 


Suppose K is a number field with [kK : Q] = n. Let o1,...,0n be all the 
Q-isomorphisms of Kk into C. If for some a in K, o1(a) #.0;(a) for all 7 £1, 
show that the degree of a over Q is n. 


Theorem 5.25 (Hermite). There are only finitely many number fields of a 
given discriminant. 


Proof. In view of Theorem 5.22, it is enough to show that there are only 
finitely many number fields K with fixed degree [K : Q] =n =r + 2r2 and 
of discriminant dx = d # 0. Depending on whether 1, is zero or positive, we 
first define a centrally symmetric compact subset S of R” ~ R™ x C™, and 
compute its volume. 
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1. Ifr, =0, ie. if K is totally imaginary, S' is the following product 
of one rectangle in C and rz — 1 disks, each of radius 1/2. 


2 rg—-1 
S = {(z1,..-,2r,) € C™| [Re(z1)| < 2"? (2) V|dx|, 


\Im(z1)| < 1/2, |z;| < 1/2 for 7 = 2,...,ro}. 


Then 


nonvr() wea(e(i)) 
=. 9-72 / idx] 


= 2"u(p0). 
2. Ifry, > 0, then 
De 
S = {(@1,..., 2p} 21,---52r5) | [eal < gr-l (2) \dx|, 


|x;| << 1/2 for 7 =2,...,71 and |z;| < 1/2 for 7 =1,..., ro}. 


won (2)" (C8) 


= 2" (pO). 


Again, 


i) 


Hence, by Minkowski’s Lemma on (compact) convex sets, O contains a nonzero 
element a with p(q) in S. 


Now 
T1 rytre 
1<|N(o)| =] lose)! [] los@)1, 
j=l g=ritl 


and |o;(a)| < 1/2 for j = 2....,71 + ro. Therefore, |o,(a)| > 1, ie. o1(a) F 
o;(a) for 7 € 1. By the Exercise above, dega = n and K = Q(a). Thus, we 
have shown that each number field K C C with [kK : Q] = n and dx = d 
is generated over Q by an algebraic integer a of degree n, such that all the 
conjugates of a are bounded by a constant c = c(d,n). By Proposition 5.24, 
there are only finitely many such a in C. 


6 
Analytic Methods 


In this chapter, we shall prove Dedekind’s famous formula of 1877 for the class 
number hx of a number field K, namely 


im, (s —1)¢x(s) =hk-k, (6.1) 


where ¢x(s) is the Dedekind zeta function 


CK (s) = Na) (6.2) 


The summation is over all nonzero integral ideals a of Ox. We shall show 
that the series on the right of (6.2) converges absolutely for real s in the open 
interval 1 < s < co. The constant « depends only on K and can be computed 
explicitly. In fact, most of this chapter is devoted to the computation of k. 


Theorem 6.1. (Euler Product Formula) For s > 1, 


ct =TI(1- x5) 


where the product is over all the nonzero prime ideals p of K. 


Proof. We only sketch the proof, leaving the details to be filled in by the 
reader. 


First, because N(p)* > 1, 


(: 1 ‘a oe eee ee 
N(p)> » N(p)e © N(p)s 

We formally multiply these series, one for each prime p, to obtain 
ie eee 1 

1 SS eee = Bape Od =a degre 

I ( nor) LO NGP Py 


In the summation, each product pf! ... pg’ occurs exactly once. Therefore, by 
Dedekind’s unique factorization theorem for ideals, 


1 1 
NG = Ne 
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the summation being over all nonzero integral ideals of kK. Hence, the Euler 
product formula follows. 


6.1 Preliminaries 


In Chapter 5, we proved that the group Of of units (of the ring of integers) 
of a number field K is isomorphic to Wx x Z". Here Wx is the group of 
roots of unity in K and r = ry + rg —1. Recall that r; (resp. rg) is the 
number of real (resp. pairs of complex) Q-isomorphisms of K into C, so that 
[Kk : Q) =1r1+2re. Let wi,...,u, be a fundamental system of units in A, that 
is to say, any u in O% can be uniquely expressed as 

U= nut ...ue, 
with 7 in Wx and aj,...,a, in Z. We now use the set {u1,...,u,} of funda- 
mental units in K to define an important invariant of K, called its regulator, 
which is intimately related to its class number hr. 


The hyperplane 


V= {v = (Aigetny Andre) E Ret ?s |) 7 fee + Aritre = 0} 


is a r-dimensional subspace of R™*"2. Let o1,..., Ory) Ory4ds Ory dls: +9 Orytres 
Or,+r. be all the Q-isomorphisms of K into C. In Chapter 5, we defined a map 
A: KX + R42 by A(a) = (log |oi(a)|, ..., log |a,, (a)|, log |or,+41(@)|?,--., 


log |o,;+r.(a@)|*), which is a group homomorphism from the multiplicative 
group K™ into the additive group R"+". We proved that A(O%) is a full 
lattice in the r-dimensional subspace V of R™*", defined above. To define 
the regulator, we need to compute the r-dimensional volume pu(A(O;)) of a 
fundamental parallelepiped of \(O;.). 


6.2 The Regulator of a Number Field 


We now state and prove a theorem that leads to the definition of regulator. 
For u in OF, let A;(u) denote the j-th component of the vector A(u) of R"™*". 


Theorem 6.2. The r-dimensional volume u(A(Ox)) of any fundamental par- 
allelepiped of the (full) lattice A(O;) in V is given by 


uA(OR)) = Vri +72 | det(oi(uj))I, 
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where {$1,...,@r} is an arbitrarily chosen subset of {A1,...,Ar,4r.} of car- 
dinality r. 


In particular the quantity 
Rx = | det(d;(u;))| 
depends only on K, and not on the choice of ¢1,...,r. 


Definition 6.3. The regulator of a number field K is the absolute value 
Ri = | det (9; (u;))| 
of the r x r determinant det(¢;(u,;)). 


Proof. Consider the unit vector 


1 
Se nee Game 
ae a 
in R"+"2, By the definition of V, the inner product (w, x2) = 0 for all x in V. 
Hence u 1 V (w is perpendicular to V). If 


L = (OX) ® Zu, 


then L is a full lattice in R™!*"? and the r-dimensional volume p(A(O;x)) of 
A(O;) is equal to the (r1 +r2)-dimensional volume of L, which is the absolute 
value of the (r; + r2) x (r1 + rg) determinant 


a 1 
1 A (u1) ae Aritre (u1) 
vritto |: 
Ai (ur) Aritre (ur) 
For all u in Of, we have 
j=l 


Hence, if for a given m (1 <m< 11 +19), all other columns are added to the 
m-th column, and the resulting determinant is expanded by the m-th column, 
the above determinant is easily seen to be equal to 


yo (ry +2) » det(;(uy)) = EV ra det (o;(ti,)). 


Since, up to a sign, the determinant does not depend on the choice of m, we 
are done. 
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6.3 Fundamental Domains 


In order to compute the constant « in the class number formula (6.1), we also 
need to study the so-called fundamental domain of Kk. Once again, recall our 
notation: 


1. K C Cis a number field, 
2. [kK : Q)=n=171 + 2ro, 
3. r=m+Tr2-1 
4. Wk ={n € K|n™ =1 for some m in N}. 
We put w = |Wx|. We also choose a set u1,...,u, of fundamental units of 


K, 
Then, the set {A(u1),...,A(ur)} is a basis, over R, of the r-dimensional 
subspace V C R™+"2 given by 
Ay tess +Arytre = 0. 


The vector u = (1,...,1;2,...,2) ¢ V. Hence, any vector v in R"*"? has a 


ryls; rg2s 
unique representation 


v = a,X(u1) ++++ + a,A(u,) + au, 


with a,a,; in R. 
As before, let | be the homomorphism from the multiplicative group £L = 
(R*)" x (C*)" to the additive group R™*", given by 


UGijt ow sr Heise ayes) S (log |a1|,..., log |a,, |; log |z1|?,..., log |z,,|?). 
(6.3) 


Definition 6.4. A set D is called a fundamental domain for Kk if D consists 
of the vectors a in £, such that 

1. l(@) = ayA(u1) +--+ + a,A(u,) + au with 

0 <a; <1 Gj =1,...,7r), 

2. 0 < Arg(wx(1)) < 2. 


[Here (1) stands for the first coordinate of x. If r; > 0, then w(1) € R and 
the condition (2) means that x(1) > 0. | 
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Example 6.5. Let us determine the fundamental domain D of a real 
quadratic fied K = Q(Vd), d > 1 being a square-free integer. We have 
[kK : Qo=n=2, 7 = 2,r2 = 0,sor=r4+rg—1= 1. If uw is a fun- 
damental unit in K, then so are +u;,+1/u;. Among them, there is one and 
only one which is larger than 1. Denote it by €. We call € the fundamental unit 
of K. 


The first basis vector \(u1) in equation (6.3) is A(e) = (log |o1(€)|, log |a2(e)|) = 
(log e, — loge). This is so because 


log |o1(e)| + log |o2(e€)| = 0. 


Since rp = 0, wu = (1,1). Let (x,y) € R™+7? = R?. Then equation (6.3) 
becomes 


I(x, y) = (log |x|, log yl) = aiA(€) + aw = (a; loge +.a,—ay loge +a). (6.4) 


The fundamental domain D is determined, in this case, by the conditions 


1. cy £0, 
2.0<a; <1, and 


3. 2>0. 
But from (6.4), we have 
log |x| = a1 loge +a, 
log |y| = —a; loge +a. 


Subtracting, we get 
log al = log 2”, 
ly| 

Le. 

Ll =e) or ly) =e Nal 

ly| 
This together with conditions (1) and (3) above imply that D consists of two 
components, one in the first quadrant lying between the straight lines y = x 
and y = 42, and a similar one in the fourth quadrant. See Figure 6.1, which 


suggests the following. 
Definition 6.6. A subset D of a real vector space is a cone if it contains a 
nonzero vector, and ca € D whenever x € D, c € Rt (the set of positive 


reals). 


Theorem 6.7. A fundamental domain D for a number field is a cone. 
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FIGURE 6.1: Fundamental domain. 


Proof. Clearly, the nonzero vector (1,...,1) € D. Next we show that if 
conditions (1) and (2) defining D hold for x, then they also hold for ca, given 
any real number c > 0. 

(1) Let w = (%1,...,%r,,21,---,2ra) € D, so that 

U(x) = aj A(u1) +--+ + G-A(uy) + au 
with 0<a;<1,j=1,...,r. 
If c > 0, then 
I(ca) = (log |cr,|,..., log |cx,,|, log |cz1|?, .. . , log |ez,.|”) 
(a) + log(c)u 
= a,A(u1) +--+ + a,A(u,) + (a + log c)u. 


Hence, ca also satisfies (1). 


(2) is obvious, because for a real number c > 0 and a complex number z, 
Arg (z) = Arg (cz). 


The reason for calling D a fundamental domain for K is that its elements 
form a complete set of coset representatives for the quotient group £L/p(Ox). 
Recall that D C £ = (R*)"™ x (C*)"™ CR". 


Fundamental Domains 79 


Theorem 6.8. Let D be a domain for a number field K. Any y in £L can be 
written, uniquely, as 

y = p(u)-# (6.5) 
with u in OF and «x in D. 


[The dot in (6.5) is the component-wise multiplication in L.] 


Proof. Let 7 be the generator of Wx C K given by 
” = cos | — } +7zsin | — }. 
w w 


I(y) = by A(ur) aT by A(ur) a bu, 


We can certainly write 


with 6, b; in R. Let a; be the fractional part of b;, i.e. 
aj = b; = [b5] with 0< aj< Le (6.6) 


Also choose m in Z such that 


2 2 
O< Arg (y(1)) — 2 < (6.7) 
w w 
If we take 
wa yall 
then 
g=p(u')-yeD. 
In fact, 


Wa) = ayXA(uy) + -++ + apA(ur) + bu 
with a; as in (6.6), and by (6.7), 


2rm 


Arg (a@(1)) = Arg (o(7™)y(1)) = Arg (vay € [0,2n/t)). 


rear 
To prove the uniqueness in (6.5), it is enough to show that if 
plu): x = plv)-y, (6.8) 


with u,v in OF and a, y in D, then u=v andaw=y. 


In equation (6.3), if v = p(v) with v in OF, then all a; € Z. Thus applying 
the map / to both sides of (6.8), and noting that two real numbers are equal 
if and only if their integer parts, as well as fractional parts, are equal, we get 


l(a) = I(y) and thus A(w) = A(v). 
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[Recall that the map \ = lo p.] 
Now A(u) = A(v) implies that v = wu for w in Wx. Hence by (6.8), 


w = plw)y, 
which implies that 
Arg (2(1)) = Arg (y(1)) + Arg (w). (6.9) 
Since ; 
O< Arg (w(1)), Arg (y(1)) < = 


and Arg (w) is a multiple of 24, equation (6.9) is possible only if Arg (w) = 0, 
which implies that w = 1. Hence, u = v, which also gives x = y. 


Definition 6.9. Let a, 3 € Of. We say that a and £ are associates if 8 = ua 
with some u in Op. 
Being associate is an equivalence relation, and hence partitions the nonzero 


elements of Ox into the set € of equivalence classes. Let D be a fundamental 
domain for K. Theorem 6.8 defines an injective map 


f:€9~D 


as follows. For a £ 0 in Ox, write (uniquely) p(@) = p(u)- x with # in D 
and u in Of. Put f(a) = a. Clearly if a, 6 are associates, then f(a) = f(8). 
Therefore, for C in €, we put 


t(C) = fla) =« (6.10) 
for any a in C. 
Recall defining the norm function N : £ —+ R. For « = 
(Dai ayaa Biya aie Me IN GE ay <i ay A e e 


Theorem 6.10. Suppose D C L is a fundamental domain for K. The re- 
stricted fundamental domain, that is, the set 


X=Xp={xeD|N(ax) <1} 
is measurable with measure 


w(X) = 27 rR /w. (6.11) 


We break the proof into simpler parts. 
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Lemma 6.11. Let c = (C1,...,Cr,,$1,--+;$ro) EL. If S CL is a (Lebesgue) 
measurable set, then so isc: S' and 


ule: S) =|N(e)|H(S). 
In particular, if |N(c)| =1, the measure p is invariant under the map 


Laaerec vel. 


Proof. If S is a measurable subset of R and c € R, then pu(cS) = |c|u(S). 
Now let s = poe’ € C. Under the multiplication by s, i.e. 


x= pe” > ppl +), 


du(z) = pdpdd > ppod(ppo)d( + 4) 
= popdpdd 
= |s|? - du(z). 
The Lebesgue measure on £ = (R*)" x (C*)"2 is the product of the Lebesgue 
measures on its factors. Therefore, if S C £ is measurable, then 
u(e- 8) = Jer|-+ ler, |Is1|? ++ [Sra [?H(S) 
= |N(c)| u(S). 


Lemma 6.12. The restricted fundamental domain 
X={eeD|N(x) <1} 


is bounded. 


Proof. First take w@ = (%1,...,%r,,21,---,2r,) € X = Xp with N(x) = 1. 
Write 
W(x) = a, A(u1) +--+ + arA(u,) + au (6.12) 
with 
0 <a; el Sal ary 
Since for a unit u € Of, pi o;(u) = 0, adding the coordinates on each side 
of the vector equation (6.12), we get 


log |N(a)| = log |xi| +--+ + log |xp,| + log |z1|? +--+ + log |2r,|? 


= S-aj(log |o1(ui)| + +++ + log Jor, (us)| + log [ory 41 (us)? ++ 
i=1 
+ log |r. tro(ui)|?) +a(L +++ +14+24---42) 


=an. 
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Hence 
_ log |N(@) 


n 


(6.13) 


Since |N(a)| = 1, we get a = 0. Therefore, the subset T = {a € X||N(ax)| = 
1} of X is bounded, because it consists of # in X with 


ia) = ayA(u1) +--+ + a,A(ur), 


where 
0< aj< ab 


The set X can be characterized by 


X={cex|xa eT, 0<cK< I}. 


Hence X is also bounded. 


Lemma 6.13. Let Y be the set {y © L|N(y) < 1, Uy) = aiA(u1) +--+ + 
arA(ur) +au with 0 <a; <1}, with no restriction on Arg (y(1)). Then 


Proof. Let 
QI. Benin QTE 
7 = cos — +7s1n —, 
Ww w 


so that 7 is a generator of the group Wx of roots of unity in K. For each j 
(0 <j < w), we put 


Xj = pln?) +X. 
If x is in X, then 
1. |N(o(n’) + )| = |N(0(7’))||N(x)| = |N(@)I, 
2. Up(m’) +x) = U(o(7’)) + Ux) = l(a), 
3. Arg (o1(7’)x(1)) = Arg (@(1)) + 3 9. 


Therefore, Y is a disjoint union of X1,...,Xw (Xw = Xo). By Lemma 6.11, 
p(X,;) = w(X), for all 7 =1,...,w. This proves that u(Y) = wy(X). 


Proposition 6.14. The set 
MT S4 Gis Pati ee re Y lex 0 V9} 


is measurable and 


w(Y) = 2" n(¥7). 
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Proof. The 2" vectors s = (+1,...,41; 1,...,1  ) partition Y into a 


ry components rg components 
disjoint union of 2" sets s- Y*, each of measure p(Y*). (Lemma 6.11, since 
|N(s)| = 1.) Therefore, 


MY) = 27 p(Y"). 


Corollary 6.15. 
_ 27 u(Y*) 


U(X) a 


Proof. Combine Proposition 6.14 and Lemma 6.13. 


Proposition 6.16. w(Yt) ="? Rr. 


Proof. Recall that Y* consists of the vectors @ = (%1,...,@r1,21;-++3 ra) 
in £, such that 


1. 2; >0,Vg7=1,...,75; 
2. N(a) <1 (Note that (1) implies |N(a)| = N(a).); 


3. if 
log |N (a 
I(x) = mA) +--+ 7A) + PBEM) ) u (6.14) 
then 0 <7; <1 for all j = 1,...,r1. 
Let 
ay =s;+v-lt,;, = 1,...,12. 
We change the variables 
Sj = Pj cos 95, 
t; = Pj sin 0; 
with p; > 0 and @, € [0, 27), so that 
ayy / dry ...dXp, Pr... Prodpr...Adpr,dA1 ... dp, 
yr 
i.e. 
u(Y*) = (2r)” / ws fo Lee Pry... Apr, dr, ...dLp,. (6.15) 
We use equation (6.14) to change variables 71,...,%,,;(1,---,Pr. (which 
satisfy 


1. 2; >0 for all j =1,...,171; 
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Dis Tee N07 8s Bea) Se 1) 
to T1,...,Tr,T = N(a) (satisfying 0 <7; <1, 7 =1,...,71; and0 <7 <1). 


To compute the Jacobian of this substitution, we rewrite equation (6.14) as 


a 
log 2; = DM 2 eee (6.16) 
and 
£ 2logT 
2log pi =D TyAi(uj) +=, (al eR Go (6.17) 
For i = 1,...,71, equation (6.16) gives 
Ox; 
Or; = x;r;(u;), oY ies Lee gt 
For i = 1,...,1r2, equation (6.17) gives 
Op; Pi . 
SS Ait) 7S aay Th 
OT; 9 (u;) J r 
Finally, 
Ox; XG 1 
= — = ‘ T 1. 
Or nt’ re 
and 
Opi ee 1 r 
Or nt’ ce ila 


Hence the Jacobian J is the absolute value of the determinant 


Az (uz) ae LA (Uy) xy /nr 

Lp, Avy (U1) cio Lr Ary (Ur) Lp, (NT 

Pir 4i(tr)/2 +++ pir 4i(tr)/2 pi /nr 

ProArytre(Ui)/2 +++ ProAr+re(Ur)/2 Pro /nT 
A1(u1) suey A1 (ur) 1 
= T1°°* Lr, Pl * Pre Ar, (ur) pe Ar, (Ur) 1 
N27 Ariti(ti) +++ Ar ti(Ur) 2 
Arytrg(ti) +++ Artre(Ur) 2 


If we add other rows to the first row, and then note that 
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1. SoU" dj(u) = 0, Vu € OX, and 


j=l 
2. T= E10 he Pt + Poy 
we obtain # 
J=——* _. 
272041 eee Pro 
Hence, (6.15) becomes 


iY SSO Jo 6+ ppg J+ dpy +++ dpr, dx, +++ d&,, 
yr 
1 1 
=n"Rx f af dp, - +: py, dx, +++ dx,, 
0 0 


= nT? Rr. 


Proof of Theorem 6.10. By Corollary 6.15 and Proposition 6.16, 
271 u(¥ *) 

w 
2 Re 


W 


W(X) = 


6.4 Zeta Functions 


We now return to the Dedekind class number formula. 


6.4.1 The Riemann Zeta Function 


The most famous zeta function is the Riemann zeta function ¢(s) defined for 
s=o+it in C with o > 1 by 


j= (6.18) 


However, throughout this chapter, we shall assume that t = 0, that is s € R. 


Theorem 6.17. The series for ¢(s) in (6.18) converges for s >1 and 


lim (s — 1)¢(s) = 1. (6.19) 


sa1+ 
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Proof. Let s > 1. For x € (1,00), + is a decreasing function. Hence, 


% os 


3 
3 
iM 


Therefore, for N > 2, 


which gives 


i.e. 


Multiply this inequality throughout by s—1 and let s > 1+, to obtain (6.19). 


6.4.2 A Partial Zeta Function 


Suppose D is a fundamental domain for a number field kK with degree 
[Kk :Q) =n=7r1 + 2re. Let 


X =Xp = {x € DIN(«) <1} 


be the restricted fundamental domain. For t in R, N(ta) = t”N(a). Let L 
be a lattice in R™ x C™ = R”. For a real number s, define the partial zeta 
function Z(s) = Z(L, D,s) by 


Zs)= So —— (6.20) 


N(a)|s 
«xELND 
Clearly, Z(s) depends on L and D. 


Theorem 6.18. The series for Z(s) on the right of (6.20) converges for s > 1 
and 


lim, (5 — 1) 2(s) = w(X)/p(L). 
sol1+ 
Proof. For t€ R, t > 0 and S C R”, let 


tS = {tv |a € S}. 
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Since L is discrete and X is bounded, the number 
1 
v(t) = |tx nN L| = xr 72 
of points common to both tX and L is finite. Moreover, if A = y(L), then 


= _ v(t) 
v= p(X) = jim, A— re (6.21) 
This is so, because +L provides a disjoint cover of X by v(t) parallelepipeds, 
each of measure A/t”, for an upper Riemann sum to approximate u(X) from 
above. As t goes to infinity, we get u(X). 


Since L C R™ x C™ = R” is discrete and the norm on L is a non-constant 
continuous function, there are only finitely many points in L of bounded norm. 
Hence, we can arrange the points of LM D in a sequence {a,,}, such that 


1 < |N(a1)| < |N(@2)| < |N(@s)| <--* 


Let 
tm =|N(am)|1/”. 
Because 
V(tm) = |L A tmX| 
={y€ LND||N)| < tm}; 
we have 


V(tm) =m. (6.22) 
On the other hand, for any € > 0, x; ¢ (tm — €)X, if 7 > m. Hence, 
U(tm —€) <m < V(tm). (6.23) 


Because t?, = |N(a@,)|, we obtain 


U(tm — €) m U(tm) 
 <(N@eal = ca 


Finally, (6.21) and (6.24) imply that 
m v 
lim ——— =—. 2 
mite [Nem A ee 
The equality (6.25) can now be ee to compare the series in (6.18) and (6.20), 
ie. to say that for s > 1, 3°°°_, +, converges if and only if 


1 
2a wey - Near 


does. 
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Again, if s > 1 and € > 0, by (6.25), for all sufficiently large m, say m > mo 
for some mg, we have 


(= c) < Weal <qte 


This gives 
(4 iy 1 1 e € 1 
A °) ms ~ \N(ame SA") ma 
Hence, 
v ss 1 = 1 v a | 
—— — —_— — —. 2 
Ca ee N@a)l> ~ (at) Do ae (626) 
m=mMo m=mot1 m=mot1 
But mo 1 mo 4 
alim, (s » >» |N(am)|$ _ Ba 7” 1) = ms =e O20) 


Therefore, multiplying (6.26) throughout by s—1, taking the limit as s > 1+4, 


and adding (6.27) to the limit, by Theorem 6.17, we get 
x -—e< im, Z(s) eS ate 


Since € > 0 can be arbitrarily small, this proves Theorem 6.18. 


6.4.3. The Dedekind Zeta Function 


Let K be a number field of degree [K : Q] = n = 171 + 2rg. Recall that for 
s=o+it in C, o >1, the Dedekind zeta function ¢x(s) of K is defined by 


Cx(s)= > Way’ (6.28) 


where the summation is over all nonzero integral ideals a of Ox. In particular, 
if kK = Q, all the integral ideals a are of the form a = nZ for n in N, and 
N(a) =n. Hence the Dedekind zeta function 
<a 
m=1 mé® 


is just the Riemann zeta fuction ¢(s). 


Let h = hx be the class number of K and {C),...,C,} be its ideal class 
group. We write (6.28) as 


h 
Ce (s) = pe Co; (8), 


where 


the summation being over all integral ideals b in C;. 


We will restrict s to be in R, and show that 
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1. each Cc, (s) converges for s > 1 and 
2. lims_.14(s — 1)¢c,(s) exists, and is independent of j = 1,...,h. 
Let C be one of C;,...,C,, and choose an integral ideal a in C~!, so that 
for every integral ideal b in C, ab = (a) = aOx. Since this means that 
a € a, we get a bijective map 6 > (a), @ in a, mapping integral ideals b 


in C' to the principal ideals generated by nonzero elements of a. Moreover, 
|Nx/o(@)| = N(ab) = N(a)N(6). Thus we have 


N(6) = |Nxo(@)|/N(a), 
and 


. 1 
éa(s) = N(a) s Nola 


Now (qa) = (@) if and only if a and £ are associates = |N(a)| = |N(8)|. (We 
write Nx /g(a) simply as N(q).) Therefore 


1 1 
s Dar > eF 


aea 


the last sum being over all the pairwise non-associate elements a # 0 of a. 


Let D be a fundamental domain of K. Equation (6.10) defines a bijective 
map € 5 a-— f(a) = a € D, from the set of equivalence classes of non- 
associate elements of Ox to D. Therefore, because N(a) = N(f(a)), 


1 1 
2 Iver = 24, Maye = 9 


xELND 
aea 


with the lattice L = p(a). This shows that the series for ¢¢(s) converges for 
s > 1. Further, by Theorem 6.18, 


lim, (s —1)éc(s) = lim N(a)*(s — 1)Z(L, D, s) 


so1+ 
— 3 H(X) 
=e ay 


27 R 1 
lim N(a)?--"—“"** . 
s1+ v0) 2-72 \/|dx|N(a) 
Qritre q?2 Rie 


wy |dx| 


=RK, 
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proving that this limit « is independent of the ideal class C. Hence, summing 
over all C, 
im, (s —1)¢x«(s) =K- hr. 


This is the Dedekind class number formula we set out to prove. 


Theorem 6.19 (Dedekind). We have 


Qritre 772 Rig 
lim (s—1 s) = —_—_ - hx. 
lim, (s — 1)¢x(s) IEE K 


EXERCISE 
Specialize Theorem 6.19 when K is a quadratic field. 
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Arithmetic in Galois Extensions 


In 1894, Hilbert developed decomposition theory to determine the factoriza- 
tion of a prime ideal p of & in K for Galois extensions K/k in terms of the 
Galois group Gal(K/k). 


An extension K/k of number fields is a Galois extension if it is a normal 
extension, i.e. for every k-isomorphism o : K — C, we have o(K) C K. [A 
ring homomorphism o : K > K is a k-isomorphism if its restriction to k is the 
identity map.] Some important examples of Galois extensions are quadratic 
extensions, and the cyclotomic extensions Q(¢), where ¢ is a root of unity. 
Throughout this chapter, we assume that K/k is a Galois extension of degree 
n. The set G = Gal(K/k) of n Q-isomorphisms ¢),...,0, of K into K isa 
group under the composition of maps. We call Gal(K/k) the Galois group of 
EK over k. 


Again let O = Ox, 0 = O, and write for a prime p of k, 
pO = Py Boe. (7.1) 
Ifo € G and § is a prime in K, then so is o(8). Since (pO) = pO, from 
(7.1) we get 
pO = o(P1)* --- o (Py). (7.2) 
By the uniqueness of factorization, it follows from (7.1) and (7.2) that given 


i, o(B;) =; for some 7. Conversely, we prove that given 7, Bj; = o(P;) for 
some 2. 


Theorem 7.1. [f 8 and Q are two prime ideals of K dividing a prime p in 
k, then Q = 0(§8) for some o in the Galois group G = Gal(K/k). 


Proof. Suppose not, i.e. ($8) 4 Q for all o in G. By the Chinese Remainder 
Theorem, we may choose a in O, such that 


a=0 (mod Q) 
and 
a=1 (mod o($)) 


for all o in G. Ifo, = 1x, then Nx/p(@) = a-02(a)---on(a) € QN0 = p. But 
a = 1 (mod o()) implies that (a) ¢ Q, for all o in G. Hence Nx /_(a) = 
Ioeg (a) ¢ QD p. This is a contradiction. 
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Corollary 7.2. Suppose K/k is a Galois extension of degree n, p a prime of 
k and 


pO = PP --- Per. 


Then €1 =...=€, =e, say, and f; =...=f, =f, say. Hence n = efg. 


Proof. Recall Definition 4.5 of f. Given 1,7 (1 <i < j < g), the primes 
B;, Bj in the factorization of (7.1) of p are interchanged by some o in G, but 
not e;, e;. By the uniqueness of factorization, it follows that then e; = e;. If 
o($8;) = B,;, then o induces an isomorphism O/P; = O/,;. This shows that 
fi = f;. Therefore n =e, f, +---+egfy = ef. 


7.1 Hilbert Theory 


Let K/k be a Galois extension of number fields with G = Gal(K/k). To study 
the factorization (7.1), it is enough, by Corollary 7.2, to fix a prime p of k and 
prime ‘8 of K dividing p. 


Definition 7.3. The decomposition group Z of %8/p is defined to be 
Z = Zyjp = {0 € G| o(P) = Ph. 
The inertia group T = Thx /p of B/p is the set 


T={0€Glo(a)=a (mod $),Va € O}. 


It is easy to see that Z and T are subgroups of G. Moreover, for a in 5B, the 
condition o(a) = a (mod $8) becomes o(a) = 0 (mod $8). Hence o( 8) = P. 
Therefore, T is a subgroup of Z. The letter Z and T are traditional (for their 
German equivalents, Zerlegung for decomposition and Tragheit for inertia). 


For every o in G, o(O) = O. Hence, if further o € Z, i.e. o (8) = B, then 
we have the obvious map 


a: O/P > O/P 
taking the coset a+ to the coset o(a) +. Because a), = lo, it is clear that 
lo/p = 1o/p. Hence if we put K = O/# and k = o/p, then & € Gal(K/k). 
[Recall that any extension of finite fields is Galois.] Thus, we have proved the 
following fact. 


Theorem 7.4. The map 


Z>0>7€ Gal(K/k) 
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is a group homomorphism with kernel T. Hence T is a normal subgroup of Z 
and Z/T is isomorphic to a subgroup of Gal(K/k). In particular, the order 
|Z/T| divides the order f =| Gal K/k|. 

For a subgroup H of G and a subset X of K, let X™ be the subset of X 
defined by 

X" —{r € X |o(x) = 2 for all o in H}. 

Thus K¢ =k, Kt = K, OF =o and for a prime ¥ in K dividing a prime p 
in k, BS = p. In general, K” is an intermediate field between k and K, called 
the fired field of H. It is easy to see that Ogu = OF and P4¥ = PN O". 
Moreover, the residue field O” /38" is an intermediate field between 0/p and 


O/B. 


Definition 7.5. The fixed field K% of the decomposition group Z = Z33/p 18 
called the decomposition field of 8/p. The fixed field K* of T is the inertia 


field of 8/p. 


Since T is a subgroup of Z, K% is a subfield of K’. Also K/k is Galois, 
hence 


pO = (Bi-+-By)° (7.3) 
with f; =--- = f, = f, say. So we have [K : k] =n, 
n=efg. (7.4) 


A Hasse diagram is a diagram in which the inclusion between field extensions 
and their subsets is indicated by putting a subset at a lower level of the page 
and connecting them by a line and the line joining a field extension is labeled 
with the extension degree. For example, the following diagram describes a 
central theorem of Hilbert Theory. 


Theorem 7.6. If B is a prime of Kk dividing a prime p of k, we have the 
following Hasse diagram. 
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Further, 
1. [K: KT] =e, [K7 : K%] = f, [K7 sk] =g, 
2. (a) e(B2/p) = f(P2/p) = 1, 
(>) eB /B2) =1, FOBT /B2) = FOB/p) = F, 
(c) eB/B") = eB/p) =e, FB/PT) = 1. 


Proof. Without loss of generality, we may assume that 8 = B,. 


We first prove that [K?% : k] = g. By Galois theory, the degree of the 
extension [K% : k] = [G:: Z], the index of Z in G. Hence it is enough to 
show that there is a bijection between the g primes 5B1,..., 8, and the cosets 
of G/Z. Given o in G, let o(B) = B,. By definition, 7(8) = BP, for all + 
in Z. Hence p(B) = PB; for each pp in oZ, and thus we have a map from 
oZ to j or §8;. This map is surjective, because given Bj, there is a o in G 
such that o(%) = ;. Moreover, 018 = oof implies that op 'o2(P) = P, 
ie. oO € Z, which is so if and only if 0, Z = o2Z. This gives the required 
bijection. 

The fact that [K% : k] = g, just proved, shows that the extension K/K7 
is normal of degree ef with Galois group Z. By definition, Z fixes B = Pi. 
Hence $f is the only prime in K dividing ¥7. But e’ = e(B/P7) < e(P/p) =e 
and f’ = f(B/B%) < fOB/p) = f. Now by (7.3), ef: f'- 1 = ([K: K2] =ef. 
This can happen only if e = e’, f = f’ and this also forces e(B7/p) = 1 and 
f(B7/p) = 1. 


Now we show that f(%/$7) = 1. Let K = O/P and KT = OT /P™. By 
definition, f (8/7) =1< [K : KT] =1 Gal(K/K7) = {1}. 


If for a in O, & denotes a mod, then [K : K7] = 1 SV @ in O/, the 
polynomial (2 — @)™ is in (O7 /%")[z] for some m > 1, because then each 
o in Gal(K/KT) sends @ to a root of (a — @)™, which can only be @. So let 
aed. 


The polynomial 


6(x) = T] (@ - o(a)) 


o€T 


is in OT [a]. If é(x) denotes the polynomial over K obtained from ¢(2x) by re- 
ducing its coefficients mod f, then (a) € K7 [x]. By definition of T, Vo € T, 
a(a) =a (mod §), ie. c(a) = a, hence 6(x) = (x —@)™ with m = |T|, prov- 
ing that f(B/P") = 1. Since f = f(B/p) = FOB/B") FBT BZ) FBZ /p), 
it now follows that f (87/387) = f, which implies that [K? : K7] > f. But 
by equation (7.4), [K? : K4] = |Z/T| is a factor of f. Hence [K7 : K7] = f 
which implies that [K : K7] = e. The rest of the statements now follow easily 
from what we have already proved. 
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Corollary 7.7. Let K = O/8, k =0/p. Then Gal(K/k) & Z/T. 


EXERCISE 


Give counter-examples to show that Z and T need not be normal in G = 
Gal(K/k). 


Corollary 7.8. If Z = Zsg/p is normal in G = Gal(K/k), then p splits 
completely in K%. Further, if T is also normal in G, then each of the g prime 
divisors of p in K% stays inert in K™, and finally becomes the e-th power of 
a prime in Kk. 


Proof. If Z is normal in G, then K7/k is a normal extension. Hence for all 
primes in K% dividing p, the ramification indices and the residue class field 
degrees are equal to one, and therefore, there must be g primes in K% dividing 
p. Again, by hypothesis K” /K% is a normal extension, so all the residue class 
field degrees are equal to f, hence the ramification indices are all equal to one. 
The last statement is now obvious. 


Theorem 7.9. (1) K% is the unique field L, intermediate between k and K, 


such that if Br = BOL, then e(B_/p) = f(Bzr/p) = 1, and P is the only 
prime in Kk dividing Br. 


(2) K is the unique field L, intermediate between k and K, such that for 
Br = POL, e(PBxr/p) =1 and P ts totally ramified over Pr. 


Proof. (1) Suppose L is a field with this property. Let H = {o € Gal(K/k) | 
o\, = 11}. By Galois theory, L = K". Ifo € H, then oP, = Pr. Now P 
being the only prime in K dividing Br, it follows that oB =PBPsoaeZ=> 
HCZ=> K7 CL. If e(Pxr/p) = f(Br/p) = 1, then e(/Br) = e and 
f(®%/B_) = f > [K: L] > ef > [L:k] < g. This together with [K7 : k] =g 
and K% C L implies that L = K%. 


We leave the proof of this part as an exercise. 


7.2 Higher Ramification Groups 


Again we fix a normal extension K’/k of number fields, a prime p in k and a 
prime 8 in K dividing p. For each non-negative integer j, we define the j*” 
ramification group V; of B/p by 


V; = {o € Gal(K/k)| o(a) = amod P*1,V a € O}. 
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The letter V is from the German word Verweigung for ramification. 


Thus Vo = T and 
Vo 2 Vi 2 V2 2 


By the unique factorization theorem of Dedekind, if a € O, a 4 0, then for 
some j > 0, a € P, but a ¢ Pt". Hence, 


() B? = {0} 

j=l 
and o(a) = a (mod $8’) can hold for all 7 > 0 if and only if o = id. This 
shows that = 

(1; = {1- 

j=0 
Since G is finite, V,, = {1} for some m > 0. It is easy to see that each V; is 
normal in Z, because if a € Z, T € V;, then V a in O, 


T(o(a)) — o(a) € B*", 
which implies that (o~'ro)(a) — a is in $/*1. As a consequence, each V; is 


normal in V;_1 


Theorem 7.10. (1) T/V, is isomorphic to a subgroup of the multiplicative 


group KK, where K = O/9 is the residue field. Hence T/V, is cyclic and its 
order divides qf — 1. 


2) 7 > 2, Vj-1/V; is isomorphic to a subgroup of the additive group K. 
j j pie! 


Hence, its order is a prime power p" (r > 1), where p= char K. 


Proof (1) Let.8 =o \ pc If 0° = 3S *0;p' = S-'p, = S-pB).0" = S-10 
and a0 € G. Then o(a) =a ae B/), V a in O if and only if o(a’) = a’ 
(mod $7), V a’ € O'. Hence, if 


vy ={c€G|a(a’)=a’ (mod P”),V a’ € O'S, 


then V; = Vj. Thus localizing at p, we may assume that ‘P is principal. Choose 
a uniformizing parameter at 3B, i.e. an element 7 in $B \ ?. Then YP = (7). 


Ifo € Z, then of = YP, so o(7) = aga with a, in O. The unique a, is not 
in $B, otherwise we would have of = $87. Thus we have a map 


Z>20 > Gz € (O/B), (7.5) 


where G@, is the coset of a, in O/8. We claim that the restriction of this map 
to T is an isomorphism with kernel V|. To prove this, let 0,7 € T = Vo. Then 


Agrt = 0(T(7)) = o(a-m) = o(a,)o(m) = o(a,)aoT, 
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which gives 
Agr = O(4,)dg. 


Because o € T, o(a,) =a, (mod 8). Hence, 


Agr =474g (mod J). 


Thus the restriction of the map in (7.5) to T is a homomorphism. 


To prove that the kernel is Vi, we shall need the following characterization 
of the subgroups V;. 


Lemma 7.11. The groups V,,(m > 1) can be characterized as follows: 


Vn ={o €Glo(x)=a (mod P"*)}. 


Proof. We have only to show that 
o(n)=m (mod $”*1) (7.6) 


implies o(a) = a (mod ¥"*t!), V a € O. By Theorem 7.6, OF /? = O/P. 
Hence, we may choose coset representatives of 0/38 from O7. Also the mul- 
tiplication by 7™ gives an isomorphism of the additive groups 


Opener pe. 
Hence, mod $+", each element a of O has a representation 
= a9 + ain +++ +amn™ (aj € O7). 
If (7.6) holds, then for o in Vo = T, 


a(a@) = a(ao) + o(a1)o(m) +--+ + 0(am)o(n™) 
=d9 tayn +--+: +am7™ 
=a (mod p*"), 


This proves the lemma. 


To show that the kernel of the map o > @G, is V,, let o € Vo. By Lemma 
7.11,0 €V, & o(t) = 7 (mod $7) & aga = am (mod $7) & ag = 1 
(mod 8) = a is in the kernel. 


(2) Now let j > 2. If o € Vj-1, then o(7) = 7 (mod $8’). Hence 
a(n) —n=bot, (7.7) 


for a unique b, in O. 
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If 7 is another element of V;_1, then 
T(x) —17 =b,7 (b, € O), (7.8) 
and by (7.7) and (7.8), we have 
bot! =oT(n)— 1 
=o(b,n +n)—1 
= o(b-)(o(n)¥ + (o(m) = 2) 
= o(b,)(m + bom?) + bg. 
Dividing throughout by 77, we get 
bor = O(b,)(1 + bg 7171)! + bg. (7.9) 


Because j—1 > 1, o(b,) = b, (mod 9). Hence reducing equation (7.9) mod f, 
we get _ _ 7 
ber = bo + br. 


This gives a homomorphism from V;_; to O/. 


To show that the kernel of this map is V;, let 0 € Vj_1. By Lemma 7.11, 
a € V; © o(t) = 7 (mod Pt!) & o(m) — 7 = bot) € PIT! & b, = 0 
(mod 98) = a belongs to the kernel. 


The order of the additive group O/ is Nz /9(8) = p® for some s > 1. 
Hence |V;-1/V;| =p" withO<r<-s. 


Theorem 7.12. The map 
Z>3037€ Gal(K/k) 


1s a group homomorphism with kernel T. Hence, T is normal in Z, and the 
quotient group Z/T = Gal(K/k). 


Proof. It is obvious that o + @ is a group homomorphism for o7(a + 8) = 
oT(a) +8 =Go7T(a+). By definition, its kernel is 


{co €Z|c(a)=a (mod %),V ae O}, 


which is T. Thus the map o — @ induces an injective homomorphism 
Z/T — Gal(kK/k). But Z/T and Gal(k’/k) both have f elements. Hence 
Z/T = Gal(K/k). 
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7.3 The Frobenius Map 


The multiplicative group fk of all the nonzero elements of the residue field 
k=o/pisa cyclic group of order g— 1, where q is the cardinality of k. Hence 
for each a in k, a4 = a. Moreover, since (a + 8)? = a7 + 87 for all a, B in K, 
the map ®: K — Kk, given by 

O(x) = «4 


is a k-linear map. Thus, ® € Gal(K/k). If a is a generator of the cyclic 
group K °, which is of order g/ — 1, then a, ®(a), 62(a),..., ®f—1(a) are all 
distinct, because otherwise, ®/(a) = a for some j (0 < j < f). That means 
that a” =a a~! =1, ie. in the group K”, ord(a) = qf —-1<q@-1> 
j => f, a contradiction. Since Gal(K/k) = [K : k] = f, the distinct powers, 
1,0,6?,...,®/—1 are all the elements of Gal(K/k). Hence Gal(K/k) is a 
cyclic group generated by ®. 


Definition 7.13. The generator © of the cyclic group Gal(K/k), given by 
O(x) = x! 
is called the Frobenius map over k. 
Clearly ® depends only on the ground field k of the extension K /k. 
Definition 7.14. A Galois extension K’/k is a cyclic extension, an Abelian 


extension or a solvable extension according as the Galois group Gal(K/k) is 
cyclic, Abelian or solvable. 


Recall that a group G is solvable if there is a chain of subgroups 


G=Go 2G) D---D Gn = {1} 


such that Gj, is normal in G; and the quotient group G;/Gj+1 is Abelian. 
In particular, every Abelian group is solvable. 


Theorem 7.15. The decomposition group Z = Zsg/p is solvable. 


Proof. Consider the chain of subgroup 
Z2T=YW2Vi2-:-2Vm= {I}. 


By Theorems 7.10 and 7.12, the quotients Z/T and V;/Vj41 (j = 0) are all 
Abelian groups. 


We have seen that the order of the quotient group Vo/V; divides gf — 1 = 
(q—1)(qf-! +--»+q+4+1). More is true if K/k is Abelian. 
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Theorem 7.16. If K/k is Abelian, then |Vo/V\| divides q — 1. 


Proof. Choose 7 in T = Vo such that its coset in Vo/Vi generates the cyclic 
group Vo/Vi, and choose ¢ in Z such that its coset in Z/T = Gal(K/k) is 
mapped to the Frobenius map ®: K > K, given by ®(x) = x’. 

Let 7 be a uniformizing parameter at 8 and o(m) = a7, T(7) = a,7. To 
simplify notation, let us put ag = a and a, = b. Clearly, a ¢ 8, otherwise 
o(%) = P C P?, which is not true. By our hypothesis, G is Abelian. Hence 
br = T(r) = a(b)o(m) = of(n) = To(m) = Tat) = T(a)T(7), which gives 
a(b)am = T(a)br, ie. 

aa(b) = br(a). (7.10) 


By our choice of o and 7, this gives 
bfa=ab (mod ). 
Since a,b ¢ $B, this gives 


b1=1 (mod 9). 


Hence, ord(r) = ord(b) in (O/38)* is a factor of g— 1. 


We shall need two results, one of them from Galois theory. We shall only 
state them, leaving their proofs as exercises. Recall that the composition field 
Ky Ko of Ky and Ko is the smallest field containing kK, and Ko. 


Proposition 7.17. If k,/k, K2/k are Galois extensions with Galois groups 
Gi, Go and K = Kk, Ko, then K,K2/K is Galois. The Galois group 
Gal(K, K2/K) is isomorphic to the subgroup 


{(01,02) € G1 X G2 | o1K = 2K} 


of Gy x Ga. Thus if kK, Ko =k, then Gal( Kk, Ko/K) = G, x Gy. Moreover, 
if kK, /k and K2/k are Abelian, then so is Ki Ko/K. 


Proposition 7.18. Suppose K,, K2 are Galois extensions of k and kK = 
Ky K2. A prime p of k ramifies in K if and only if p ramifies in Ky or Ko. 

[Hint: Let % be a prime of K dividing p. Let B; = BO K;. Inject the group 
Ty /p into Typ, jp X Tpo/p:] 


We also need the following fact from group theory, whose proof is left as an 
exercise. 


Proposition 7.19. Suppose G is an Abelian group of order a prime power 


p™ (m>1). Then G is cyclic if and only if G has a unique subgroup of order 
m—1 
p : 
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7.4  Ramification in Cyclic Extensions 


A cyclic extension is Abelian but not conversely. An example of an Abelian 
extension, which is not cyclic is the following. Let K = Q(V/2, V3). Then K/Q 
is Galois with Galois group 


Gal(K/Q) = Z/2Z x Z/2Z, 


which is Abelian but not cyclic. 


In this section, all extensions are assumed to be Abelian. Thus Zg3/p, Ty3/p, 
e(%/p), f(98/p), etc. depend only on p, not on 8. For example, if $%, Q are 
two primes in K dividing p, then Q = 7% for 7 in G. Therefore o® = Ps 
TOP = TPE & of P = TP S oQ = Q, hence Zy/p = Zg/p. Thus, we may 
denote these objects simply by Zp, Tp, ep, fp, etc. 


We now study the ramification of a rational prime p in an Abelian extension 
K/Q of degree p™ (m > 1). It is therefore given that Z,, T, and the higher 
ramification groups 


V; = {o € Gal(K/Q)|o(a) = amod #*"}, 


where $$ is any prime in K dividing p, depend only on p. Since G = Gal(K/Q) 
is of order p™, all the subgroups of G' and their quotients have order p" (r < 
m). We have seen that the quotient Vo/V; is isomorphic to the multiplicative 
group K”, where K is the residue field O/ of cardinality p” for some r > 1. 
Since |Vo/Vi| is also a power of p, the order |Vo/Vi| can divide p” — 1 = |K™| 
(Theorem 7.10) only if it is one. Hence, we have proved the following result. 


Theorem 7.20. Suppose K/Q is Abelian of degree p™ (m > 1). If V; (7 = 0) 
are the ramification groups for the prime p of Q, then Vo = Vi. 


Recall that p ramifies in a finite extension L of Q if and only if pldz, the 
discriminant of LZ. On the other hand, by Minkowski’s Theorem on the dis- 
criminant, dy, = 1 if and only if L = Q. Hence, in every proper extension of 
Q, at least one prime must ramify. 


If PB is a prime of a Galois extension K of Q dividing p and T = Ty, then 
no prime over p ramifies in the fixed field K7 of T. Thus if p is the only prime 
to ramify in K, then by Theorem 7.6, KT = Q. 


By Theorem 7.20, we at once obtain the following. 


Theorem 7.21. If p is the only prime to ramify in a Galois extension K/Q, 
then p is totally ramified in K. Moreover, f(8/p) = 1 and O/® = F,, the 
field of p elements. 
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For j > 2, V;-1/V; is isomorphic to the additive group of the residue field 
K = O/% & F,. Hence the following is obvious. 


Theorem 7.22. If p is the only prime to ramify in an Abelian extension K/Q 
of degree p™ (m > 1), then for j > 2, the order |V;_1/V;| is 1 or p. 


Theorem 7.23. Suppose p > 2 is the only prime to ramify in an Abelian 
extension K of Q of degree p. Then the ramification group V2 (of p) is {1}. 


Proof. Let $8 be a prime of K dividing p. By localizing, we may assume that 
38 = (7) is a principal ideal. By Theorem 7.21, p is totally ramified in K, i.e. 
(p) = (m?). Since 7 € Ox and degg(m) > 1 and [K : Q| =p, 7 is a root of a 
monic polynomial 

f(x) =a9 +aya +--+ +a? 


over Z. 


Recall the discrete valuation 
v=uy: K* 92 


at the prime %, namely that vg(a) is the exponent (positive, negative or 

zero) of $B in the unique factorization of the principal (fractional) ideal (a) into 

products of powers of distinct primes. The map v has the following properties. 
v(aB) = v(a) + (8), and (7.11) 
if u(a) # v(8), then v(a+ 8) = min (v(a),v(8)). 

Of course, (7.11) extends for v(a1...a,) and v(a, +---+Q,) in an obvious 

way. 


The Galois group Gal(//Q) has order p, so it has no nontrivial subgroup. 
Since p is totally ramified, T = G and by Theorem 7.20, T = Vo = V,. Let 
j > 2 be the smallest integer such that V;_1 = G and V; = {1}. We shall 
show that 7 = 2. 


First note that 
v(f'(m)) = j(p — 1). (7.12) 

This is so because 
f= JE om) = JE Gol). (7.13) 


o€G,oFid o€V;-1\V; 


Since 
Vj-1 ={0 € Glo(a)=a_ (mod $’),V a € O}, 


it is clear that V o in Vj;_1 \ Vj, 


u(m —a(m)) =9. (7.14) 
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Therefore, (7.12) follows at once from (7.11), (7.13) and (7.14). 
Setting a; = jaj71~! we also have 
f' (mw) = a1 + 2agn +--+ + (p—1)@p-17?-? + pr? 
=A, +02 ++''+Qp, 


say. If we write a nonzero coefficient a = ja, of f’(m) as a = bp" with (b,p) = 1, 
then in view of p being totally ramified, i.e. u(p) = p, it follows that p|v(a). 
Hence 

v(a;) =ti—1 (mod p), 
in particular, vp(a;) are all unequal. Thus by (7.11), v(f’(7)) = min v(a;). 
This implies that 


v(Qp) = v(pr?—*) = 2p — 1 > o(f"(x)) = j(p — 1). 


This can hold for j > 2 only if j = 2. [This conclusion is not true, if p = 2.] 


Theorem 7.24. Suppose K/Q is an Abelian extension of degree p™ (m > 1). 
Ifp > 2, and p is the only prime to ramify in K, then K/Q is cyclic. 


Proof. By induction on m. If m = 1, there is nothing to prove. So suppose 
m > 1. By Proposition 7.19, it suffices to show that the only subgroup of 
G = Gal(K/Q) of order p™~! is Va. 


Let H be a subgroup of order p™~!. Let K’ = K# be the fixed field of 
Hf. Since the index [G : H] = p, by Galois theory, [K’ : Q] = p and the 
Galois group G" = Gal(K’/Q) = G/H. Let Vj be the ramification groups for 
p relative to the extension K’|Q. By Theorem 7.23, Vz = {1}. Since Vj is the 
image of V; in G/H, this implies that V2 C H. Since p is totally ramified, 
G = T = Vp. By Theorem 7.20, V; = Vo = G. By Theorem 7.22, |V,/Vo| = 1 
or p. Since V2 CH ZG=V\, this shows that H = Vo. 


7.5 The Artin Symbol 


Let K/k be a Galois extension (not necessarily Abelian) of number fields with 
Galois group G. Let ‘8 be a prime of K, dividing a fixed prime p of k. Recall 
the definition of the decomposition group Z = Zg3/p: 


Z={cEGloP=F}. 


If $81, Be are two prime divisors of p in K, then Bo = 7° for some 7 in G. 
For j = 1,2, let 2; = 2, /p- Then trivially, 


Zo = ro Zyr. 
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From now on, let p be unramified (© p { 0«,;,) so that the inertia group 
Ty /p = {1}, V P® in O dividing p. Then the canonical generators oy of the 
cyclic group Zg3/p under the isomorphism 


Zy3/p = Gal((O/B)/(0/p)) (7.15) 


for all 8/p are conjugates, and in fact, form a conjugacy class in G. We denote 
; K/k 


Definition 7.25. The Artin symbol is the conjugacy class 
Cr) 
Pp 
K/k 


Now assume that G is Abelian, so that (44) consists of a single element 


of G = Gal(K/k). 


o of G. It is characterized by the property: 
o(a) = a0) (mod 8),V a € Ox, 


where is any prime of K dividing p. In other words, for the Abelian extension 
K/k, the Artin symbol is the “pullback” under the isomorphism (7.15) of the 
Frobenius automorphism ®, of O/8 over o/p defined by 


®, (a) = 7Nesalr) 


for x in O/P. 
In the Abelian case, recall our notation Z), Ty, ep, fp, etc. We then have 
[K : k] = ep fp Gp. 
Since, by our assumption, ey = 1, [K : k] = fpgp. Therefore, p splits com- 
pletely in K © fp = 1S Gal((O/8)/(o/p)) = {1} © the Artin symbol 
x/ = 1, the identity element of Gal(K/k). Therefore, we have proved the 


following result. 


Theorem 7.26. Suppose K/k is an Abelian extension of number fields and 
p a prime of k, unramified in K. Then p splits completely in K if and only if 
the Artin symbol (54) =1; 

We now illustrate this theory with an application to the simplest nontrivial 
example. 
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7.6 Quadratic Fields 


Let d 4 0,1 be a square-free integer and K = Q(Vd). The quadratic extension 
K/Q is Abelian with Galois group 


G = Gal(K/Q) = {+l}. 


Let p be an odd prime with (p, d) = 1, so that p is unramified in K. The Artin 
K/k 
p 


symbol o = ( ) is the element of G characterized by 


a(a)=a? (mod %),V a€ Or, (7.16) 
8 being any prime divisor of p in kK. We know that 
On =Z06 Lu, 


where 
Vd if d=2,3 (mod 4) 
w= 
14Vd  ifd=1 (mod 4) 


Since o\g = id, (7.16) is, therefore, equivalent to 
o(w) =u? (mod §). 
Now 2 ¢ $8, hence it is a unit in O/8. So (7.16) is equivalent to 
o(va) = (Vd)? (mod ). 


Therefore, 0 =id © 


Vd = (Vd)? (mod ). (7.17) 
Because (p,d) = 1, Vd ¢ 8, hence after cancelling Vd, (7.17) is equivalent to 


p-1 
d= =1 (mod). 
Recall the Legendre symbol, which is a homomorphism 


7x x 
coo 


a po 1 ifaeF*? 
x(a) =(—] =a? = 
Pp —1 otherwise 


This shows that (£2) = 1< the Legendre symbol (4) = 1. But (#2) 


given by 


and (4) can only be +1. Hence, we have proved the following fact. 
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Theorem 7.27. For the quadratic extension K = Q(Vd), the Artin symbol 
(£2) is the same as the Legendre symbol (4). 


If pd, it is convenient to define the Legendre symbol (4) = 0. We have the 


following fact. 


Corollary 7.28. Suppose d # 0,1 is a square-free integer and K = Q(Vd). 
For an odd prime p, 


1. p is ramified in K = (4) =0, 
2. p splits in K & (4) =1 and 


3. p stays inert in kK & (4) Sk 


Remark 7.29. In the quadratic extension K = Q(Vd), p splits completely 
= p splits. 

The following was conjectured by Fermat in 1640 and proved by Euler in 
1754. 


Corollary 7.30. An odd prime p is a sum of two squares if and only if p= 1 
(mod 4). 


Proof. The prime p = x? + y? = (a + iy)(x — iy) if and only if p splits in 
Q(V=1) + (++) = (-1)°3* =16p=1 (mod 4). 


Pp 


7.7 The Artin Map 


Let 0 = 0x, be the relative discriminant of an Abelian extension K/k of 
number fields. Fix an integral ideal m, divisible by all the prime divisors in k 
of 0. Let 3(m) be the multiplicative group of fractional ideals 


a= pri: pr” (ay € Z) (7.18) 


with all (p;,m) = 1. We extend the Artin symbol (<4) to a group homo- 


morphism, 


KK: I(m) — Gal(K/k), 
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again called the Artin map, as follows. For a as in (7.18), put 


OK/k (a) = (=5) 


Com 


Class field theory is the study of the Artin map ax /;. This map carries a great 
deal of information about the arithmetic of the relative extension K/k. For 
example, we may restate Theorem 7.26 as follows. 


Theorem 7.31. Let K/k be an Abelian extension of number fields with 
relative discriminant 0. An unramified prime p of k splits completely in 
K = pe Ker(ax;,), where axy;, is the Artin map 


Kp: 3(0) + Gal(K/k). 


We now ask the following question. Suppose K/k is an Abelian extension 
of number fields and p is an unramified prime of k, so that [K : k] = fpgp. 
In particular, g = gp is a factor of [K : k]. Conversely, given a factor g of 
[K : k], is there a prime p in k, such that g = g,? The answer is, in general 
no, because of the following fact. 


Theorem 7.32. Suppose K/k is an Abelian extension, such that there is a 
prime p ink which stays prime in K, then K/k is cyclic. 


Proof. If p stays prime in K, ice. if p is inert, then fy = [K : k]. Therefore, 
Gal(K/k) has order f = [K : k]. But there is a surjective map from Gal(K/k) 
to the cyclic group Gal(K/k) of order f = [K : k], which is also the order 
of Gal(k/k). Hence this map is also injective, and therefore, Gal(K/k) is 
isomorphic to the cyclic group Gal(K/k). 


Example 7.33. No prime stays prime in K = Q(V/2, V3), because K/Q is 
Abelian but not cyclic. Thus the answer to the above question with g = 1 is 
no. 


EXERCISES 


1. Prove Proposition 7.17. 
2. Prove Proposition 7.18. 
3. Prove Proposition 7.19. 


Taylor & Francis 
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Cyclotomic Fields 


Beyond quadratic extensions, the cyclotomic extensions are the simplest and 
best understood ones. What makes them even more important is the fact 
that any Abelian extension is a subextension of a cyclotomic extension. In the 
next chapter, we shall give a proof of this celebrated theorem. It was stated by 
Kronecker in 1853 and proved by Weber in 1887. Actually, the first complete 
proof is due to Hilbert [24]. In 1911, after some unsuccessful attempts, Weber 
did eventually succeed in providing a correct proof of his own. Several other 
proofs of this important theorem have appeared in the literature, some only 
a few years ago. 


8.1 Cyclotomic Fields 


Fix an integer m > 1. The m-th roots of unity are the roots in C, of the 
polynomial z”” — 1. They form a cyclic group f1y, of order m, generated by, 


6 =m = e27t/™ — cog (=) +i7sin (=) : (8.1) 
m m 


The primitive m-th roots of unity are the generators of the cyclic group [m. 
Thus 77 is a primitive m-th root of unity if and only if 


1) = Cy (a, m) _ 1. 
Let ¢(m) be the Euler totient function 
o(m) = #{a Ee Nl] 1 <a<m, gced(a,m) = 1}. 


We have seen in Chapter 5, that 
if 
d(m) =m II 1——}. 
Pp 
plm 
In particular, 


é(p) =p—1, o(p") = (p—1)p” + and ¢(2") = 2"7?. 
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In terms of the totient function, we may say that there are exactly ¢(m) 
primitive m-th roots of unity. We shall exclude the trivial case m = 1 and 2, 
and unless stated to the contrary, assume that m > 2. The symbol ¢, will 
stand for cos (7) + isin (27), whereas ¢ will be any m-th root of unity. 
Definition 8.1. The m-th cyclotomic field is the number field K = Q(¢;). 


The following statements are obvious. 


1. Ifn|m, then Q(C,) C Q(Cm), because ¢, = Cr/”. 


2. Let d= g.c.d. (m,n) and c = Lem. [m,n]. Then Q(¢m) A Q(Gn) = 


Q(¢a) and the composite field Q(¢m)Q(Gn) = Q(¢-). In particular, 
the intersection and composite of cyclotomic fields are cyclotomic 


fields. 


3. The group Um C Q(¢m). If o is a Q-isomorphism of kK = Q(¢,,) into 
C, then for ¢ in Um, (a(¢))” = o0(¢™) = o(1) = 1, hence o(¢) € K. 
Thus K is a normal extension of Q. 


We now record this trivial but important fact. 
Theorem 8.2. Q(¢n)/Q is a Galois extension for all m > 1. 


Definition 8.3. The minimal polynomial ®,,(x) of Gj, = cos (7=) +isin (22) 
is called the m-th cyclotomic polynomial. 


We shall show that all the primitive m-th roots of unity have ®,, (x) as their 
minimal polynomial, i.e. the primitive roots in (4, are the conjugates of Gn. 
We shall need the following calculation. 


Theorem 8.4. Suppose ¢ is any primitive m-th root of unity. Then 


I] @-@)=Cyr tn. (8.2) 
a 


Proof. All the roots of x” — 1 are ¢,¢?,...,¢'™ = 1. Hence 


oe” -1= |] (#-¢). (8.3) 


j=l 


Equating the constant terms in (8.3), we get 


("7 = [[¢, (8.4) 
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whereas, on differentiating (8.3) and evaluating the resulting equation at x = 
¢’ for i=1,...,m, we get 


m 


mS a) (8.5) 
it 


Now take the product for alli =1,...,m to get 
i. m-1 
m™ (iI “) = | Co). (8.6) 
i=l 


1<i,j<m 
iZj 


Substituting for []j”., ¢’ from (8.4) in (8.6), we obtain (8.2). 


Corollary 8.5. If ¢ = Gn, K = Q(¢), then the discriminant dx|m™. In 
particular, ifm =p" (r > 1) is a prime power, then |dx| = p® (s > 1). 


Proof. Being minimal, dx divides the discriminant of any basis of K/Q, 
consisting of elements of Ox. In particular, dx|A(1,¢,...,¢"~+), where n = 
[K : Q. But A(1,¢,...,¢"7') is the square of the van der Monde determinant 
det(o;(¢7)), where o1,..., 0 are all the elements of Gal(K/Q). Hence, 


A(1,6,-..,6"77) = [[(i(Q) — 950). 


iAj 


This is a subproduct of the left-hand side of (8.2), which shows that dx|m™. 


Theorem 8.6. Suppose ®,,(x) is the minimal polynomial of Gn over Q and 
K =Q(Gn). Then we have the following. 


1. ®,, (a) € Z[a], 
2.n(2)= |] (@-96). 


CEbn; 
C primitive 


In particular, deg ®,;,(x) = d(m). 
3. Gal(K/Q) & (Z/mZ)*. 


Proof. (1) is just the well known Gauss’ Lemma (see [23, p. 120]). 


We shall prove (2) and (3) simultaneously. Suppose o € G = Gal(K/Q). 
Then o is uniquely determined by the value o(¢;,). Moreover, € = o(Gm) is 
also a primitive m-th root of unity, for otherwise o~'(€) = Gm would not be 
a primitive root of unity, either. So o(¢m) = ¢%, 1 < a = a(a) < m with 
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(a,m) = 1. Further, it is easy to check that a(o1) = a(o2) @ 01 = o2. Thus, 
the map 
G>3oa-a(oc) € (Z/mZ)* (8.7) 


is injective. In particular, 


deg ®m(x) = | Gal(Q(¢)/Q)| < o(m). (8.8) 


This map 0 —> a(c) is easily seen to be a group homomorphism. If we show 
that ®,,,(a) vanishes at all the primitive m-th roots of unity, then we will have 


deg ®,,(x) > d(m), 


which together with (8.7) and (8.8) will prove everything. 


If ¢ and 7 are two primitive m-th root of 1, then 7 = ¢% with (a,m) = 1. 
Thus 7 may be obtained from ¢ by replacing it successively with ¢?, for each 
prime divisor p of a. Hence, it is enough to show that if a primitive root ¢ of 
1 is a zero of ®,,(x), then so is ¢?. 


So fix a primitive m-th root ¢ of 1 with ®,,(¢) = 0 and a prime p, not 
dividing m. First note by (1), i.e. by the lemma of Gauss, ®,,(x) is a monic 
polynomial in Z[z]. By the Multinomial Theorem, 


(®m(2))? — Om (a?) € pZ[z]. 
But ®,,(¢) = 0. Therefore ®,,,(¢?) is divisible in Z[¢] by p. Now write, 
®,,() = |] («-»), 
wel 

where J is a subset of fim. If ®m(¢?) #0, then [],,-;(¢? — w) is a subproduct 
of 

Li Cae): 

1<i,j<m 

tAj 
Hence, ®,,,(¢?)|m in Z(¢). But p|®,,(¢”) in Z[¢]. Hence p|m™ in Z[¢]. But this 
can happen only if p|m”™ in Z, which is a contradiction. 


Theorem 8.7. Suppose m = p” (p prime, r > 1), € = Gn and K = Q(¢). 
Then p is the only prime ramified in K. Further, p is totally ramified in kK, 
i.e. (p) = p%™ for a prime ideal p in K. The ideal p is principal, generated 
by1l—¢. 


Proof. We know that p is ramified in Kk if and only if pldx and by Corol- 
lary 8.5, dx = p® (s > 1). Hence p is the only ramified prime. 
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To show that p is totally ramified, first note that ¢ in um, for m = p’, is 
primitive if and only if ¢’/? 4 1. Hence, the primitive m-th roots of 1 are 
precisely the roots of the polynomial (a — 1)/(#/? — 1) in Z[z], ie 


a” —1 F 
emp — 1 = II (x _ Oy 
1l<j<m, 
(j,m)=1 


This, together with L’Hospital’s rule, gives 


I] @-@=1m [] @-¢)= tim 2 — =p, 


ein rl jen rol gm/P — 1 
(j,m)=1 (j,m)=1 
Le. 
p= |] a-¢). (8.9) 
l<j<gm 
(j,m)=1 
To compute the right hand side of (8.9), first note that 1 =14¢4---4 
(J-! € Ox. On the other hand Deca (j,m) = 1, i7 =1 (mod m) for some 
i. Therefore the field ene ae — = =f = =14¢€94+¢7 +...+¢0-Di is also 
= as 


in Ox. This shows that + ee ‘a 
to equation (8.9), 


p= JT a-e)= 0-0 TT 2 =wa-gam 


1<j<m 1<j<m 
(jm)=1 (jm)=1 


for u in OX. Therefore (p) = (1 — ¢)#™). Since ¢(m) = [Q(¢) : Q = efg, 
f =g=1, e = ¢(m) and (1 — ¢) has to be a prime ideal p. 


cr are both units in Ox. Now returning 


Corollary 8.8. Given a set S = {pi,...,p,} of distinct primes, there is an 
extension K/Q, such that p ramifies in K if and only if p€ S. 


Proof. Take K to be the composite K,---K;, of Kj = Q(G,)- 


Example 8.9. We illustrate Theorem 8.7 with two examples 


(1) Let m = 2?, so that ¢ = ¢4 = e?7/4 = i = V—T. Then K = Q(¢) = Q(i) 
and [K : Q| = ¢(m) = 2. Further, Ox = Z@ Zi = Z|i], the Gaussian integers. 
We have 
1 i 


x =|; -i 


By Theorem 8.7, 2 is the only prime to ramify in K and (2) = (1 —7)?. This 
can be checked directly: In Z|], the ideal 


(1-4)? = (1-4) = (-28) = @). 
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(2) Let m= 3, C=w = e?*/3, K = Q(w), [K : Q| = $(3) = 2. We have 


(Q—w)? =((1-—w)”) = +0? — Qu) 
= (—w — 2w) = (—3w) = (3). 


EXERCISES 


1. Suppose m = p” (p odd, r > 1). Show that Gal(Q(¢,)/Q) is cyclic 
of order (p—1)p"~". 


2. If m = 2" (r > 3), show that Gal(Q(Gn)/Q) is a direct product 
Gi x G2, where G; = {1, complex conjugation } and |G2| = p"~?. 


Theorem 8.10. Let m > 2 be an integer such that either m is odd or 4|m. 
Then a prime p ramifies in K = Q(Gm) if and only if p|m. 


Proof. If p is ramified, p|dx. But by Corollary 8.5, dx divides m™. Hence 
p\m. 


Conversely, let p|m. To show that p is ramified in K, it is enough to find a 
subextension k of K in which p ramifies. For this, we use Theorem 8.7. 


Case (1). If p is odd, take k = Q(¢,), and we are done. 


Case (2). If p = 2, then 4|m and (7/4 = V—1. We take k = Q(/—1). Then 
2 is ramified in k. 


Remark 8.11. The converse is not true when m = 2m’, with m’ odd. We 
leave it as an exercise to show that 2 is not ramified in Q(¢¢). 


Theorem 8.12. Let K = Q(a) be a number field of degree n over Q. The 
discriminant A(1,a,...,a@"~') of the basis 1,a,...,a"~1 of K over Q is given 
by 

A(1,a,...,a"~") = Nxja(f/(a)), (8.10) 


where f(x) is the irreducible polynomial of a over Q. 


Proof. Let G = Gal(Q(a)/Q) = {o1,...,on}. Then the discriminant 
A = A(l,a,...,a"~') of l,a,...,a"~+ is the square of the van der Monde 
determinant 

1 
oi(a) «s,s On (@) 
= [[(oi(a) - o;(a)). 
i<j 
o1(a"—+) On(a”—t) 
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Hence 
A= | | (ai(a) — o;(@)) 
ifj 
= II oi(] [la — 6; ‘9; (@))) 
i=1 jt 
= [oT] @- 200). (8.11) 
Now 


If we differentiate f(x) and evaluate at x = a, we get 


f(a) = [[ @-o(0)). 


ofid 


Hence by (8.11), 


A= TTots'(a)) 


= Nx/o(f'(a)). 


Proposition 8.13. [fq =p" > 2 1s a prime power, ¢ = Cq, then the discrim- 
inant 


a) ge?) 
A(1,¢,...,¢%9-4 = (- O_. (8.12) 
palP 
Proof. As before, 
xt — 1 
®,( ) = rdP—1 
is in Z[x]. Differentiate this expression for ®,(x) and evaluate at x = ¢ to get 
! = gga 
#,(¢) ie ¢4/P = 1° 
On taking the norms of both sides, we get 
(a) 
(—1) 2 q?) 
Nxja(®,(4)) (8.13) 


~ Nxjo(Gd/? — 1)" 
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Since 7 = ¢9/? is a primitive p-th root of unity, 


Nx/o(6"? — 1) = Nx/o(n- 1) = Nemye(Nxa(n- 1), 


i.e. 
Nxjo(64” — 1) = (Nein o(n — 1/0), (8.14) 
But 
— [K:Q) _ d(p") _q 
I: G0) = aay = 6@) ~p (8.15) 
and 
Nom/o(n-1) = [J G-¢) = lim ©,(«) = lim = =p. (8.16) 


a=1 


Hence, we get (8.12) from (8.10) and (8.13)—(8.16). 


Proposition 8.14. If qg = p" > 2 is a prime power, ¢ = Gq and K = Q(¢), 
then Ox =Z[C] =ZOZCO@---@ ZCH-1, 


Proof. Clearly, Z[¢] C Ox. So we only need to show that Ox C Z[¢]. We 
shall use the fact that w = 1-—¢ is a uniformizing parameter at p, where 


(p) = pe). 


It is obvious that Z[¢] = Z|w]. Hence it suffices to show that Ox C Z[w]. 
Since K = Q(¢) = Q(w), it follows that 1,w,...,w%(%—! is also a basis of K 
over Q. Therefore, if a € K, we have 


o(q)-1 o(q)-1 
a= ajw’ = ye b;¢? 
i=0 j=0 


with a;, 6; in Q. We need to show that a; € Z. 
Let G = Gal(Q(¢)/Q) = {o1,..., on} with n = $(q). Then 


n-1 
Gar = Se Bal C); $= Agena gm (8.17) 
j=0 


Solving the system (8.17) of n linear equations in n variables, bo,...,0n—1 by 
Cramer’s rule, we see that b; = %, where 7; € Ox andA = A(1,¢,...,¢"~"). 
By Proposition 8.13, A is a power of p. Therefore, for all j, pb; € Z and 
hence, pa; € Z, if m is sufficiently large. Let m > 0 be the smallest integer 
such that pa; € Z for all 7. We need to show that m = 0. Suppose m > 1. 
Then there is an index 7, 0 < i < n—1 such that pa; € pZ for j = 
0,1,...,i—1 but pa; ¢ pZ. 
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Since m > 0, p"a € pOx =w"O. Also for 7 = 0,1,...,4-—1, p™aj € pZ © 
w"Ox. Hence 


n-1 t—1 
B= ) paw? = p™a— ) paw? 
jai j=0 


is in w"Ox. (If i = 0, the second sum is absent.) Thus 
n-1 
pajw* = B- S- p™ajwi—* 
j=itl 
is in wtIOx. Hence p™a; is in wWONZ = pZ. This contradicts the choice of 
v 


Finally, we are in a position to compute the discriminant of an arbitrary 
cyclotomic field. Moreover, we shall show that if m > 2 is arbitrary, ¢ = Gn 
and K = Q(¢), then Ox = Z[C]. 


In the remaining section, unless stated to the contrary, let Ky, Ko be two 
number fields such that [K : Q| = [Ki : Q\[Ke : Q|, where K = KK 2. We 
denote by d; the discriminant of K; and by O,, the ring of integers of K; 
(j = 1,2). We recall (without proof) the following fact from Galois theory. 


Proposition 8.15. Given Q-isomorphisms 0; : K; + C (j = 1,2), there is 
a Q-isomorphism 0: K + C, such that oj, = 05. 


We denote by 01,02 the smallest subring of K containing both O; and Oz. 
It consists of all the finite sums of the form 


T1Yi +... + XUNYN (x; E O1,95 E Oz). 
We shall need the following. 


Proposition 8.16. Leta be the g.c.d of the discriminants d,, dz of Ky, Ko, 
respectively. Then aOx C O1O2. 


Proof. Let O; = Za, ®...®@ Zam and O2 = Zh, ®...6 ZBy with dy = 
Al Ops 14a) and dy = PN Ge eeee Gan: 


Since Q1,...,Q@m is a basis of Ky over Q, and §1,...,6n is a basis of Ko 
over Q, the condition [K,K2 : Q| = [Ay : Q|[K>2 : Q| implies that {a;4;} is a 
basis of Ky Ky over Q. If a € Ox, then surely, 


Ai 
a= De a a4 8; (8.18) 
tJ 


with a,j, bin Z and b > 0. We take b > 0 to the smallest common denominator 
of the coefficients of a;8;. Then 6 is coprime to at least one a;;. We need to 
show that for any a in Ox, this least common denominator bla = gced(d1, d2). 
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Let 01,...,0m be the m Q-isomorphisms of Ky into C. By Proposition 8.15, 
each o can be extended to a Q-isomorphism of Kk into C, whose restriction to 
Ko is the identity of Kz. We denote this extension by o also. We apply o = a; 
(r =1,...,m) to (8.18) to get 


or(a) => ~ o,-(a;). (8.19) 


We put 


Then the system of equation (8.19) becomes 
S/o, (ai)ei =o,(a), (r=1,...,m). (8.20) 
i=l 


Solving (8.20) by Cramer’s rule, we get c; = %, where y; € Ox and the 
rational integer 
A= A(aj,...,Q@m) = dh. 


This shows that fori =1,...,m, 


dic, = » dy a B; 

j=l 
is in Ox as well as in Ko, and hence in O02 = Ox 1 Ko. But O2 = ZG, @---@ 
Zn. Hence, d, 44 € Z, for all i, 7. Since b is coprime to at least one a;;, b/dt. 
Similarly, b|d2. Hence bla = gcd(dj, dz). 


Theorem 8.17 (Integral Basis for Cyclotomic Fields). Suppose m > 2 is any 
integer, C= Gn and K = Q(¢). Then 


Ox = Z{¢] =ZOZC@::: azZcrm-}, 


Proof. We use induction on the number s of prime divisors of m. If s = 1, 
the theorem has already been proved as Proposition 8.14. 


If s > 1, let m, =p” be the largest power of a prime p appearing in m. Let 
mz = m/m,. Then m = mymz2 with m1,m2 > 1 and ged(m1, m2) = 1. For 
j =1,2, let K; = Q(Gm,), dj = dx,;. By Corollary 8.5, ged(d,, dz) = 1. 


By induction hypothesis on s, O; = Ox, = Z[¢m,]. Since K = Q(Gm) = 
Kk, Ke and a = gcd(d1,d2) = 1, by Proposition 8.16, Ox C 0102 = Z[Gn]. 
Because the reverse inclusion, Z[¢] C Ox, is obvious, Ox = Z[Cn]. 
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Corollary 8.18. [fm > 1 is any integer and K = Q(Gm), then the discrimi- 


nant 
dx = (nee : mor) / [[ 2%"/@-?. 


p\m 


Proof. Let n = $(m) and G = Gal(K/Q) = {o1,...,on}, with o, = id. If 
€ = Gm, let wy = 6, wo = 02(C),..-,Wn = On(C) be the n conjugates of ¢. 


Since {1,¢,...,¢"~+} is a Z-basis of Ox, the discriminant dx is the square of 
the van der Monde determinant 
1 ae 1 twig wer 


o1(¢) «~~ On (€) 1 25, wee GE 


on(¢r-2) tai SOG od 1 ig Sale aa 


To complete the proof, we leave it as an exercise to compute the product 


[[@ i wy), 


V<j 


where w}1,...,Wy are all the primitive m-th roots of unity. 


EXERCISE 


(Kummer’s Lemma) Suppose m > 1 and K = Q(¢,,). If for u in C, & denotes 
its complex conjugate, show that u € Of implies that u/U € Jum, the group 
of m-th roots of unity. 


Hint: Let ¢ = Gm. Given o in Gal(K/Q), o(¢) = ¢* for some a. Since 


a(¢)=C4# = Ce o commutes with complex conjugation. Therefore, if a = u/U, 
then a € Of and |o(a)| = 1 for all o in G. Hence, a is a root of unity in K. 
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Let m > 2, ¢ = Gn and K = Q(C) be fixed. We now consider an arbitrary 
prime p with gcd(p,m) = 1. Then the Artin symbol (48 


element o of the Galois group G = Gal(K/Q) characterized by 
a(a) =a? (mod §), (8.21) 


) is the unique 


Vain Ox. In equation (8.21), $B is any prime of K dividing p. On the other 
hand, o is also uniquely determined by the value of o(¢), which is a primitive 
m-th root of unity and hence ¢% for some a € N with (a,m) = 1. Since 
Ox =Z[¢] and V a in Z, a? =a (mod p), condition (8.21) is equivalent to 


¢* =¢P (mod ). (8.22) 
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Furthermore, it follows at once from the following Proposition that the con- 
dition (8.22) is further equivalent to 


a=p (mod m). (8.23) 


Proposition 8.19. Suppose p and 8 are as above. Let a,b E N with 1 < 
a,b <™m, (a,m) = (b,m) = 1. If 


6° =¢" (mod §), 
then a= b. 
Proof. Suppose a > b. Then ¢* = ¢° (mod 8) & ¢7~° = 1 (mod $). Hence, 


it is enough to show that if0 < a < m—1, then ¢* = 1 (mod %) = a = 0. The 
implication < is trivial. To show that ¢* = 1 (mod 8) > a = 0, differentiate 


and evaluate at x = 1 to get 


m= TTo-o) 


Hence, if a > 0, then ¢* = 1 (mod $8) implies that $8]m, which implies that 
p|m. This contradicts our assumption that gcd(p,m) = 1. 


We summarize this discussion as the following important fact. 


Theorem 8.20 (Cyclotomic Reciprocity Law). The Artin symbol for the cy- 
clotomic field Q(Gm) is given by 


( 2Gn/@) a 


where the prime p is coprime to m. 


Corollary 8.21. Suppose p is any prime with (p,m) = 1. Choose the smallest 
integer f >1 such that pf =1 (mod m). Let ¥ be any prime in K = Q(Gn) 
dividing p. Then f = fp = f(B/p), and p factors into d(m)/f primes in K. 


Proof. The condition (p,m) = 1 implies that the inertia group T, = Ty) = 
{1} and the decomposition group 


Zy = Zx3/p = Gal((O/P)/(Z/pZ)), 
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which is a cyclic group of order f, generated by the Frobenius automorphism 
®, of O/B over F, = Z/pZ. Hence, its order is the smallest f > 1, such that 


&f = id, which is the case if and only if ®,(¢) = ¢”’ = ¢ © pf =1 (mod m). 
Hence f = fp. Since e, = 1, we have ¢(m) = fpgp and we are done. 


Corollary 8.22. A prime p, coprime to m, splits completely in Q(Gm) uf and 
only if p= 1 (mod m). 


Proof. The prime p with (p,m) = 1 splits completely in K = Q(¢m) = the 


Artin symbol (Scale) =1e¢@P=1ep=1 (mod m). 


Corollary 8.23. Let p1,p2 be two primes, both coprime to m, such that p, = 
p2 (mod m). Then py, p2 split into the same number of distinct prime factors 
in Kk. 


We may say that p,,p2 have the same splitting type in K. 


Proof. Let $8; be any prime divisor of p; in K, f; > 1 the smallest integer 
with p/i = 1 (mod m) for j = 1,2. It is given that py = py (mod m). Hence 
fi = fa, which implies that g, = 9% = 2) — gy. 


fo 


Example 8.24. We take m = 10, p = 7. The smallest f > 1 with 7f = 1 
(mod 10) is f = 4. Hence by Corollary 8.21, in Q(¢io), the prime 7 splits into 
(10)/4 = 4 primes. The same is true, by Corollary 8.23, for all the primes 
17,37, 47, 67,97,... 


Remark 8.25. By Dirichlet’s theorem on infinitude of primes in arithmetic 
progression, Corollary 8.23 implies the following general statement. 


Theorem 8.26. Given m > 2 and a prime p with (p,m) = 1, there are 
infinitely many primes, which split into the same number of primes in Q(Gm) 
as p does. 


In particular, there are infinitely many primes p (namely, those with p = 1 
(mod m)) which split completely in Q(¢m). 


EXERCISES 


1. Prove Remark 8.11. 
2. Complete the proof of Corollary 8.18. 
3. Specialize Theorem 6.19 to the cyclotomoic field Q(¢). 


Taylor & Francis 
Taylor & Francis Group 
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9 
The Kronecker-Weber Theorem 


This famous theorem asserts that every Abelian extension K of Q is a subfield 
of Q(Gn) for some m. These days, it is customary to obtain the Kronecker- 
Weber Theorem as a corollary to the main theorems of class field theory. How- 
ever, we shall give an elementary proof, based essentially on that of Hilbert. 


9.1 Gauss Sums 


We begin with a modest goal, namely, to show that every quadratic extension 
K = Q(vVd) with Gal(K/Q) © {+1}, which is Abelian, is cyclotomic, i.e. 
contained in Q(¢,,) for some m. The main tool is the Gauss sum. 


Let p be an odd prime and ¢ = ¢ = e?7*/?. Recall the Legendre symbol 


(s) for an integer a with gcd(a,p) = 1. We define (s) = 1 or —1 according 


as the congruence x? = a (mod p) has a solution or has no solution. In other 


words 
a\ Jl if a € Fx? 
p ~ )-1 — otherwise. 
It is convenient to set (s) = 0 if pla. Since ¢* depends only on the residue 


class of a (mod p), ¢® is well defined for a in F,. 


Definition 9.1. Suppose p > 2 is a prime. The element y = 7, of Q(G), 


defined by 
= (=) es (9.1) 


xeF, 


is called a Gauss sum. 


Theorem 9.2. For the Gauss sum y = yp, we have 


Y= (=) p- (9.2) 
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Proof. Since (2) = 0 for the zero element of F,,, 


-(Z@\(ZO)-E 0 


xeFp yCFp x,yeFp 


For a fixed x in F>, the map FF} 5 y > xy € F> permutes the elements of 
F*. Hence in (9.3), we can replace the sum over y by the sum over xy to get 


2 ay x+ny y x(1+y) 
or y Dp. @ - oP p ¢ (9.4) 
2,yeFp 2,yeFp 


=> (2) city) 4 (=) (p— 1). 


x,yck 
yF-1 


Because the sum 

L+O+---4+¢P"1 =0, 
for a fixed y £ —1, the sum Dnerx ¢*C+) is, up to rearrangement of terms, 
equal to ¢+¢? +---+¢?-! = —1. Hence, 


3 (“) cet =P (2) St catty 
xz,yeFp yA-1 e yeFp yA-1 a xeFp 


Gabe. 


yeFp 


l| 


The last sum is zero, because exactly half the elements of F¥ (p > 2) are 
squares. Hence (9.4) reduces to (9.2). 


It is convenient to introduce the following terminology. 


Definition 9.3. A number field K is called a cyclotomic field if K C Q(Gm) 
for some m > 1. 


Note that the composition of cyclotomic fields is a cyclotomic field. 
Corollary 9.4. The quadratic field K = Q(Vd) is a cyclotomic field. 
Proof. First, note that 


1: V2 € Q(¢s), 
2. J-Le Q(ds), 
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3. if p is an odd prime, then by Theorem 9.2, /p € Q(V—1, \/p) = 
Q(Ca4p). 


Hence, if we write the square-free integer d 4 0,1 as 


d= +2%,...p,  (r>0,a=0,1), 


where p1,...,pr are distinct odd primes, then Q(Vd) C Q(¢gm), with m = 
Pi-**Pr- 


9.2 Proof of the Kronecker-Weber Theorem 


We carry out the proof in a series of propositions and then put them together. 
We basically follow [19]. 


Proposition 9.5. If 2 is the only prime to ramify in a cyclic extension K/Q 
of degree 2™, then K is cyclotomic. 


For the sake of clarity, we break the proof into simple lemmas. 


Lemma 9.6. Suppose K CR is a cyclic extension of Q of degree 2™ (m > 1). 
If 2 is the only prime to ramify in K, then Q(V/2) is the unique quadratic 
subfield of Kk. 


Proof. Since K/Q is cyclic of degree 2™ (m > 1), it certainly contains a 
quadratic subfield L = Q(V/d), where d 4 0,1 is square-free. It is the fixed 
field of a subgroup of index two in Gal(/Q). By Minkowski’s theorem on 
discriminants, at least one prime, which can only be 2, must ramify in L. 
Hence 2 is the only prime divisor of its discriminant dz. Thus d = —1 or 2. 
But L CR. Hence L = Q(v2). 


Lemma 9.7. Suppose K CR is an Abelian extension of Q of degree 2”. If 2 
is the only prime that can ramify in K, then K/Q is cyclic. 


Proof. For m = 0,1, there is nothing to prove. We need to prove the lemma 
only for m > 2. If K/Q is not cyclic, then the Galois group G = Gal(K/Q) is 
a product 

G=C,x::-xC, (r > 2) 


of nontrivial cyclic groups C; of order 2%. We show that this leads to a 
contradiction. For simplicity, we take r = 2. (For r > 2, the proof is similar.) 
If K; is the fixed field of C;, then by Lemma 9.6, we have the Hasse diagram 
(cf. Figure 9.1) with K = kK, Ko. 
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FIGURE 9.1: Hasse diagram for K = Kk, K2 


Since m = m1, +z, the degree [K : Q] = [K, Ky: Q)] < 2™7t™-bagml. 
This is a contradiction. 


Lemma 9.8. Suppose K/Q is an Abelian extension of degree 2™, in which 2 
is the only prime that can ramify. Then K is the composite of K,, K2, where 
K, CR is cyclic and Ky = Q or Q(vi). 


Proof. If K C R, we are done (Lemma 9.7). If not, the complex conjugation 
7 is an element of the Galois group G = Gal(K’/Q) of order two and 


G =(r) x H, 


where H is a subgroup of G of order 2'~!. The fixed field Kp = K” of H 
is an imaginary quadratic field in which 2 is the only prime to ramify. Hence 
Kz = Q(i). The fixed field Ky of 7 is contained in R and is of degree 2”~?. 
Again 2 is the only prime that can ramify in K,. Hence by Lemma 9.7, K1/Q 
is cyclic. 


Proof of Proposition 9.5. We use induction on m. If m = 1, K is a quadratic 
field, hence it is cyclotomic. So suppose m > 1. First note that K contains a 
real cyclic subextension of Q of degree at least 2”~1. It is the fixed field of 
complex conjugation. Hence by Lemma 9.6, K D Q(V/2). 


Now let n = 2” and consider the subfield L = Q(C4n +C4,,) of Q(Can). Since 
L CR and 2 is the only prime to ramify in L, by Lemma 9.7, L/Q is cyclic. 
The degree [LZ : Q] of the cyclic extension L/Q is n = 2”. Hence by Lemma 
9.6, L > Q(V2). It is now clear from the Hasse diagram (Figure 9.2) that the 
degree of the composition KL over Q satisfies the inequality 


[KL : Q| <n?. (9.5) 
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KL 
ye 
K tF = L=Q(Can + Gan) 
mi | ont 
Q(v2) 
orn 
Q 


FIGURE 9.2: Hasse diagram for K L. 


By Proposition 7.17, the Galois group [ = Gal(K L/Q) is a proper subgroup 
of G x H, where G = Gal(K/Q), H = Gal(L/Q). Let o,7 be the generators 
of the cyclic groups G and H, respectively, which agree on KM L, so that 
(0,7) € I. The group A, generated by (0,7) has order n. Hence it follows 
from inequality (9.5) that the degree of the fixed field F = (KL)* of A over 
Q satisfies the inequality 

[Fe 2Q) = oma, 


The prime 2 is still the only prime to ramify in F. By Lemmas 9.7 and 9.8 
and the induction hypothesis on m, F is cylotomic, and therefore so is FL. 
Thus we will be done if we can show that FL = KL, since KE D K. But 
this is obvious, because only the identity automorphism in A restricts to the 
identity on L. 


Proposition 9.9. Suppose p is an odd prime and K is a cyclic extension of 
Q of degree p™. If p is the only prime to ramify in K, then K is cyclotomic. 


Proof. Let n = p™*+. The n-th cyclotomic field Q(¢,) has degree (p — 1)p™ 
over Q. Let L be the unique subfield of Q(¢,,) of degree p™ over Q. We show 
that kK = L. If not, then the compositum KL is Abelian of degree more 
than p™ over Q, and in KL, p is the only prime that ramifies. Hence by 
Theorem 7.24, KL is cyclic. But (by Proposition 7.17) Gal(KL/Q) has no 
element of order larger than p™. This contradiction proves that kK = L. 


In the next proposition, we remove the condition that | is the only prime 
to ramify in a cyclic extension of degree /” (J > 2, a prime) for the field to be 
cyclotomic. 
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Proposition 9.10. Let! > 2 be a prime and K/Q a cyclic extension of degree 
I™ (m>1). Then K is cyclotomic. 


Proof. We use induction on the number r of distinct primes p,...,pr 4 l 
that ramify in K. If r = 0, we are already done (Propositions 9.5 and 9.9). 
So suppose r > 1. Let p = p, and $8 a prime divisor of p in K. All subgroups 
and quotients of G = Gal(K/Q) are cyclic and their orders are power of I. 
Since (p,l) = 1, by Theorem 7.10, all the higher ramification groups of 8/p 
are trivial and by Theorem 7.16, the order /* of the inertia group T’ = Tyy/p, 
Le. 

IZ|= 2, 
divides p— 1. On the other hand, the cyclic extension Q(¢,)/Q of order p— 1 
contains a unique subfield LZ with 


and p is the only prime to ramify in L. (In fact, p is totally ramified in L.) 
So, the degree of the extension KL/Q is (cf. Figure 9.3) 


[KL:Q] =I" (b< a). 


If H = Gal(L/Q), then the Galois group Gal(K L/Q) is isomorphic to a sub- 
group of G x H. In fact the isomorphism is given by 


Gal(KL/Q) 37 > (qK,7\L) € GX A. 


gp’ KL K'L 
*\ Ze Qn) 
K p 
aa A 
xy K re 
rN %e 
p Q 


FIGURE 9.3: Hasse diagram. 


If $’ is a prime divisor of $B in KL and T’ = Ty), C Gal(KL/Q) is the 
inertia group, we first show that the order 


\T"| = 1°. (9.6) 
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Put $7 = OL. Then the order |T’| is the ramification index e(B’/p) and 
e(’/p) > e(Bxr/p) = p*, because p is totally ramified in LZ and [L : Q] = p*. 
So 

IT’) > 12. (9.7) 


The restriction of r in T’ to K maps T” into T, so T’ C T x H. The higher 
ramification groups of %8’/p are trivial, so being isomorphic to a subgroup of 
the cyclic group (Ox, /3’)*, T’ C Tx FH is also cyclic. Because |T'| = |H| = /*, 
no element of JT x H has order larger than /*. Hence 


In| <0. (9.8) 


Now (9.6) follows from (9.7) and (9.8). 


FIGURE 9.4: Another Hasse diagram 


Let K’ be the fixed field of T’ C Gal(KL/Q) (cf. Figure 9.4), and $B” = 
3B’ 1K’, then 8” is unramified over p. Further, 


K'NL=Q, 


because if we put k = kK’ L, then p is both unramified and totally ramified 
in k, which can happen only if k = Q. 


Now, because K’ is the fixed field of the subgroup T” of order /*, we have 
[KL : K'] =1*. Therefore, 


[KL :Q| =[KL: KK’: Q) =[*[K’ :Q. 
Also, because K’N L = Q, 
[K’L : Q) = [K’: QL: Q| = [K’ : QU’. 
Hence K’L = KL. 


Finally, p = p, is unramified in K’, and if a prime q # pj,...,pr—1,/, then 
q is unramified in K’. Further, K’/Q is cyclic of degree 1*. Thus, by induction 
hypothesis, K’ is cyclotomic, hence so is K’/L = KL D K. 
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Theorem 9.11 (Kronecker-Weber). If K is a finite Abelian extension of Q, 
then K is cyclotomic. 


Proof. By the Fundamental Theorem of Abelian groups, G = Gal(K/Q) is 
a direct product of cyclic groups of prime power orders, say 


G=C,x:-:xC,. (9.9) 


Let G; be the subgroup of G, obtained from (9.9) on replacing the j‘” factor 
C; by {1}. We put K; = K@:, the fixed field of G;. Then Gal(K;/Q) & Cj, 
so each K; is cyclotomic. But K is the compositum of K1,...,K,. Hence k 
is also cyclotomic. 


10 


Passage to Algebraic Geometry 


In this chapter, we indicate how the subject treated in the previous chapters 
leads naturally to the study of arithmetic algebraic geometry. 


Dedekind’s generalization (in 1870) of numbers and primes in Z to ideals 
and nonzero prime ideals in the ring Ox of integers in a number field K is 
a crowning achievement in number theory. Dedekind was a student of Gauss. 
Riemann, another student of Gauss, made (in 1859) probably the most famous 
conjecture in all of mathematics, called the Riemann Hypothesis. He assumed 
it in order to count the number of primes p < x, conjectured by Legendre and 
Gauss. The zeta function he used for this purpose had already been studied 
by Euler as a function 


(=> 5 (10.1) 


n=1 
of a real variable o. 


The series in (10.1) converges for o > 1. Riemann studied it as a function 
of a complex variable s = o + it, namely 


¢(s)= 0 ae (10.2) 


The series on the right of (10.2) converges for Re(s) > 1. He showed that the 

function defined by (10.2) for Re(x) > 1 extends by a functional equation to 

an analytic function on the whole complex plane C except for a simple pole at 

s = 1. The function ¢(s) is called the Riemann zeta function. The Riemann 
1 


Hypothesis asserts that all its nonreal zeros lie on the line o = 5. 


The series on the right of (10.2) has the Euler product 


s =-I(-3) » (10.3) 


The Dedekind zeta function ¢x(s) of a number field K is defined analogously 
by 
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the sum taken over all nonzero ideals a of Ox. This too has the Euler product 


Cx(s) = II (1 - <r) 7 (10.4) 


the product taken over all prime ideals 4 (0) of Ox. Clearly, it is a general- 
ization of the Riemann zeta function and counts the number of prime ideals 
pin Ox with N(p) < a, for x arbitrarily large. The Riemann Hypothesis 
can be extended to ¢x(s), for this count, to assert that its nonreal zeros have 
Re(s) = 3. 

During the earlier part of the 20th century it was realized that every zeta 
function counts something of interest to number theorists, if defined properly. 
For example, if we have a curve C' defined over a finite field F, of ¢ elements 
by an irreducible polynomial equation 


f(x,y) =0, (10.5) 


there are only finitely many points on C' with coordinates in the algebraic 
closure of F, of a “bounded size,” which can be counted by the zeta function 
¢a(s) of the curve C. 


For a clue on how to define ¢c(s), we write (10.5) as 
Gn (x)y” + +++ + ar(x)y + ao(z) = 0 (10.6) 


with a,;(a) in the field F,(x) of rational functions over F,, to get an extension 
K of F,() of finite degree, analogous to that of Q. 


Once it was realized by Ostrowski, Artin and others that the primes in Z, 
or more generally the prime ideals p in the ring Ox of integers of a number 
field K are in a one-to-one correspondence with (inequivalent) valuations on 
K, the Dedekind zeta function was defined entirely in terms of the valua- 
tions on K. The valuation on the function field F(x, y) = quotient field of 
Fy |x, y|/(f(x, y)) of the curve C' defined by (10.5) are given by the points on 
C. Thus to define the zeta function ¢¢(s) of the curve C and to propose the 
Riemann Hypothesis for ¢c(s), one appeals to the points on C. Later in the 
chapter, we will prove the Riemann Hypothesis for curves of genus 1 over F, 
and show that it is intimately related to counting points on these curves. 


Surprisingly, the Riemann Hypothesis for curves over finite fields has been 
shown to be true, by Hasse for curves of genus 1 and by Weil in general. 
Moreover, its generalization to varieties over finite fields (solutions of one or 
more polynomial equations in n(n > 2) variables over finite fields) conjectured 
by Weil, was proved by Deligne in 1974 for which he was awarded the Fields 
Medal. However, the original Riemann Hypothesis remains unproved. 


In view of the above discussion, in order to study the number fields and 
function fields of curves over finite fields as a single subject, by a global field 
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K it is meant that K is either a number field or the function field of a curve 
over a finite field. 


10.1 Valuations 


Suppose K is a field (not necessarily a global field). We denote the nonzero 
elements of kK by K™. A discrete valuation on K isa map vu: K* — Z, such 
that for all x, y in K%, 


L. u(ey) = 0(0) + vy) 
2. v(a+y) > min(v(z), v(y)). 


As a matter of convenience, we extend a valuation toa map v : K > ZU{oo} 
by setting v(0) = oo. Throughout this chapter we will exclude the trivial 
valuation given by the zero map. 


Two valuations on K are equivalent valuations if they can be scaled to give 
the same valuation. Among all the equivalent valuations, there is a unique one 
which is surjective. We use this normalized valuation as a representative of all 
valuations equivalent to it. 


Example 10.1. The p-adic valuation 


Let K = Q and p = 2,3,5,... a fixed prime. Any x 4 0 in Q has a unique 

representation 
a 
a Vp (x) = 

a: 
where ¢(b > 0) is in the lowest form, p and ab have no nontrivial common 
factor and vp is in Z. It is easy to check that vp is a valuation on Q, called 
the p-adic valuation. 


The p-adic valuations on Q can be extended in an obvious way to p-adic 
valuations on a number field K, p being a nonzero prime ideal of Ox. For a 
nonzero x in K write the principal fractional ideal (7) = « Ox generated by 
x as 


— pup (*) ki 
es aes 
with p not appearing in the unique factorization of a and 6 as products of 
powers of distinct prime ideals of Ox. It is a trivial consequence of Theorem 
3.69 (independence of valuations) that different prime ideals give inequivalent 
p-adic valuations. 
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Discrete Valuation Rings 
For a discrete valuation v on a field K, let 

O, = {a € K | v(x) > 0} 
and 

py ={x © K | v(x) > O}. 


Then O, is a ring, called the discrete valuation ring of v and p, is a prime 
ideal of O,. If the quotient O,,/p, is finite, we call the cardinality |Op/p,| the 
norm of the valuation v and denote it by Ny. 


In some situations, it is preferable to work with the multiplicative version 
of the discrete valuations on K, called the absolute values on K. 


Definition 10.2. An Archimedean absolute value on a field K is a map 
|| : & — [0,1) such that 


1. ja] =OS2z=0, 
2. |xy| = || |y| and 
3. Triangle Inequality holds: |a + y| < |x| + ly}. 


Clearly the restriction of || to K*, the group of nonzero elements of K, is 
a group homomorphism from K™* to the multiplicative group R™ of positive 
reals. 


Example 10.3. For z = «+ iy in C, |z| = 22+ y? is an Archimedean 
absolute value on C. If K is a subfield of C, it inherits this absolute value 
from C. The theorem of Gelfand and Tornheim [1] asserts that it is essentially 
the only Archimedean absolute values on K. 


Definition 10.4. A non-Archimedean absolute value on a field K is a map 
||: & — [0,1) with 


1. ja] =OSa2=0, 
2. |xy| = |x| |y| and 
3. a stronger inequality |a + y| < max(|z|, |y|) holds. 


By an absolute value we shall mean a non-Archimedean absolute value, 
unless to the contrary is clear from the context. If |x| = 1 for all x 4 0, 
the absolute value is trivial. It will be excluded from our discussion. Two 
absolute values ||; and ||2 are equivalent absolute values if for a constant 
c> 0, |z|2 = |x|¢ for all x in K. We can define a distance dist(x, y) between 
points x, y and K by dist(#,y) = |a — y|. The equivalent valuations thus 
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define the same topology on K. In fact, two absolute values are equivalent 
if and only if they induce the same topology. For the fields that concern us, 
there is a natural choice for the normalized absolute values on K. 


Example 10.5. 1. For kK = Q the p-adic absolute value is given by 
|z|p = pve) 


for x £0. 


2. If K is a number field, p a nonzero prime ideal of Ox, the p-adic absolute 
value on K is defined by 
2p = N(py-. 


In particular, if K = Q, N(pZ) = p and thus p-adic absolute value on K is a 
generalization of the p-adic absolute value on Q. 


The following result complements the theorem of Gelfand and Tornheim. 


Theorem 10.6 (Ostrowski). The only (inequivalent) non-Archimedean abso- 
lute values on a number field K are the p-adic absolute values on K. 


Proof. We prove it for K = Q and leave it as an exercise to extend it to 
a number field K. It is immediate by 2) of Definition 10.4 that | +1] = 1. If 
n > 0 is an integer, by 3), |n| = |1 +--- +1] < 1. Because || is nontrivial, 
|n| < 1 for some n ¥ O. In fact, |p| < 1 for a prime p, for otherwise by 2), | | 
would be trivial on Q. 


We now show that if g 4 p is another prime, then |g| = 1. Suppose |q| < 1. 
Choose integers a and b such that 1 = ap + bq. Then 


1 = |1| = lap + bg| < max((p|, |q|) < 1. 


This is a contradiction. 


Finally, put |p| = ¢ > 0. Then |2| = c’?), hence || is equivalent to | |p. 
Up to equivalence, there is only one Archimedean absolute value on a 
number field Q, the one inherited from the absolute value on C (Gelfand- 
Tornheim). It is convenient to denote it by |z|.. and regard it to be the one 


that corresponds to the infinite prime p = oo. 


Theorem 10.7 (Product Formula). For « £0 in Q, 
II [t|p - 1, 
P 


where p = co, 2,3,5,.... 
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Proof. The proof follows at once from the unique factorization in Q and the 
definition of ||, for p = 00, 2,3,5,.... 


Remark 10.8. The product formula extends to number fields in a fairly 
obvious way. 


Reformulation of Dedekind Zeta Function 
Let Vx be the set of all (inequivalent) non-Archimedean absolute values on a 
field K. For each v in Vx, we define two subsets of K by 

O, ={vE K | |a]y < 1} U {0} 


and 
po = {xe K | |z|, < 1} U {0}. 


It is easily seen that O, is a ring and p, a prime ideal of O,. Thus we can 
make the quotient ring O,/p,. If its cardinality |O,/p,| is finite, we let N, = 
|Ou/pu|, the norm of v. If K is a global field, this is always the case. In 
particular, if kK is a number field, we identify v with the corresponding prime 
ideal p of K. Then N, = |O/p| = N(p), the norm of p. So we can reformulate 
the Dedekind zeta function as 


¢x(s)= [J ce (10.7) 


10.2 Zeta Functions of Curves over Finite Fields 


We begin with a curve defined by an irreducible equation 


f(x,y) =0 (10.8) 
over any given field k. It is smooth or non-singular if the partial derivatives af 


and gi do not vanish simultaneously at any point on C, including the points 


at infinity on it. The genus g of such a curve is defined by 


(n — 1)(n — 2) 


; (10.9) 


I= 


where n = deg f(z, y). 


We assume all our curves to be smooth. Then the genus is 1 if and only 
if f(x,y) is a cubic. For the sake of simplicity in our exposition, we assume 
that in case k is a finite field, it is not an extension of Fz or F3. Then by a 
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suitable substitution for the variables, a curve of genus 1 having a point with 
coordinates in k is defined by a Weierstrass equation 


y=ax?+ar+b (a,b€k) (10.10) 
with the discriminant A = —16(4a? + 27b?) 4 0. 


For the so-called geometric case of global fields, one takes f(x,y) in (10.8) 
with coefficients in a finite field F, of a prime power g = p*(a > 1) elements. 
Then this global field is the function field of C, which by definition is the 
quotient field of the quotient ring F,[z, y]/(f(z, y)). It is an integral domain 


for f(a, y) is irreducible. If we have rational functions pew and joe where 
numerators and denominators agree modulo f(x,y), they define the same 
function on C’. Therefore, for each point P on C’, with coordinates in the 
algebraic closure Fy of F,, we can define a discrete valuation vp on K as 
follows: 
the order of zero of f at P 
up(f) = ¢ — the order of pole of f at P 


0 if P is neither a zero nor a pole of f. 


Since a rational function has only finitely many zeros and poles, vp(f) = 0 
for all except finitely many P. 


It is a standard fact from algebraic geometry that for any f in K, the number 
of its zeros is equal to the number of its poles, counted with multiplicities, i.e. 


S> up(f) =0 (10.11) 


PEC 


but not as stated here. Recall that the multiplicative version of (10.11), namely 
Theorem 10.7, is not valid until the absolute value ||, for the infinite prime 
p = oo is taken into account. Similarly, for (10.11) to hold, the summation 
above should include the points at infinity on C. 

To explain what the points at infinity on C’ are, one homogenizes its equation 
(10.8) by putting « = 4, y = in it and clears the denominators to get the 
homogeneous polynomial equation 


_ geeet p(X Y)_ 
F(X,Y,Z)=Z 1($.3) =0 (10.12) 
and require that (X,Y, Z) 4 (0,0,0). Then (X,Y, Z) is a solution = for each 
c #0, c(X,Y, Z) is a solution of (10.12). All such solutions are regarded as 
one point on C. The solutions to (10.8) are the points with Z = 1 and the 
points at infinity on C' are those with Z = 0. The curve C is a complete curve 
when the points at infinity on it are added to it. 


Example 10.9. Let E be the elliptic curve (by definition a curve of genus 1, 
given by its Weierstrass equation (10.10)). 
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By substituting « = %, y = ¥% in (10.10) and clearing the denominators 
y § Zr YU WA § 


from the resulting equation, we get the homogeneous equation 
Y?Z = X*°+axZ?+bZ 
of degree 3. If Z = 1, we get 
Y7=X*?+4axX +b (10.13) 


which is the same equation as (10.10). Thus we recover all the points on E 
defined by (10.10). If Z = 0, then X = 0. But since XYZ #0, Y #0, 
which may be taken to be 1. Hence F has only one point at infinity with 
homogeneous coordinates (0: 1 : 0). Moreover, (0: 1:0) = (0: +t: 0) fort 
however large we want, so (0: 1: 0) = (0: too : 0) is the point at infinity on 
FE. Intuitively, one may think of it as two points at infinity on either end of 
the y-axis. 


Singularities 
A point P on a curve C is singular point if all the partial derivatives 
OF OF. OF 
OX OY OZ 
vanish at P, i.e. we cannot write the equation of the tangent line to C at P. 


The curve C is non-singular or smooth if it has a tangent at every point on 
it, including at the points at infinity on it. 


Exercise 10.10. Show that the cubic defined by (10.10) is smooth if and 
only if #3 + ax + has distinct roots. [If a, b in a field k, assume that k is not 
an extension of F2 or F3.] 


There is another subtlety that needs to be addressed. If we allow, which we 
will, the points on C' to have coordinates in every finite field containing F, 
(the field of definition of C), different points on C can define the same vp. 


Example 10.11. To be concrete let us take gq = 3 and f(x,y) = y. Then 
Fs[x, y|/(f(@,y)) = F3[2] and the function field of the curve C defined by 


f(x,y) =0 


is kK = F3(x). Therefore, C is the x-axis. For the sake of completeness we 
include the single point on x-axis at infinity. Thus C is the projective line P* 
over F3. 


Since —1 is not a square in F3, the function f(2) = 22 +1 has no zero in F3. 
However, it factors as f(x) = (x +i)(x —i) in F3(i), i= /—1. There are thus 
two points P = (x) = (+7) on C with coordinates in the quadratic extension 
F3(z) of F3. For each of these points vp(f) = 1. a 
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In general, the discrete valuations on the function field K of a curve C 
are given by points on the (complete) curve C, but if a point P on C has 
coordinates in Fa but not in any smaller field, then there are d points P 
on C with the same valuation vp. For such points, N,, = q?. By definition, 
d = deg(P). Now we can say that (10.11) is valid if C is complete and the 
points P have coordinates in the algebraic closure F, of F,. 


Zeta Functions of Varieties over Finite Fields 


Let K be an algebraically closed field. A closed set in K” = K x---x K is 
KH 


n times 
the set X of solutions in K” to a finite number of polynomial equations 
y: 


fj(t1,---,%n) =0, g=l,...,m (10.14) 


with coefficients in K. If these coefficients are in a subfield k of kK, we say X 
is defined over k. A closed set V is a variety if it is irreducible, i.e. V is not a 
union V = X; U X2 of proper closed subsets X; of V. A variety V is smooth 
if at every point, it has a well-defined tangent plane. The dimension dim(V) 
of V is the number of free variables in (10.14) and it is projective if its points 
have coordinated in the projective space P”(K). 


Now let k = F, and K = F,, the algebraic closure of F,. Every element of 
F, is an element of a finite extension F,r of F, for some r > 1. Let N,(V) be 
the number of points of V with coordinates in Fyr. The zeta function of V 
has several equivalent formulations. The one we have been aiming for is 


cv(s)= [J (-2) (10.15) 


Vi being the set of (inequivalent) valuations on K, the function field of V. It 
is easily seen that 


— Nr Vv —s\r 

(v(s) =exp (> MW gm) i (10.16) 
r=1 

Following Weil, one puts t = q~* and writes (10.16) as 


Zy (t) = exp & WW) 


r=1 


In 1949, Weil [38] made the following conjectures about Z(t) after proving 
it when dim(V) = 1, i.e. for curves. 
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Weil Conjectures 

Let V be a smooth projective variety of dimension d over F,. Then 

1. Z(t) is a rational function of t of the form 


_ Py(t)P3(t) +++ Paa-i(t) 
Zu t) = RP) Paull)” 


where Po(t) = 1—t, Pog(t) = 1 — gq” and each P,(t) is a polynomial over Z 
factoring over C as P,(t) = [](1 — aijt). 
I 


2. If EF is the Euler characteristic of V, there is the functional equation 


1 
Zy (=z) x gre Zylt). 


3. Riemann Hypothesis. If Zy(t) = 0, then |a;;| = qi/? > Re(s) = 4. 


For elliptic curves (curves of genus 1) over finite fields, this was conjectured 
by E. Artin [1] in his thesis (1924) and proved in 1936 by Hasse. The rationality 
of the zeta functions of curves in general was proved in 1931 by F.K. Schmidt 
[34] whereas for varieties of higher dimensions, it was shown to be true by 
B. Dwork [14] in 1960. Finally, the last and the most difficult part of the Weil 
conjectures was proved by Deligne [12] in 1974, for which he was awarded the 
Fields Medal. The scheme theoretic algebraic geometry [20] was developed 
primarily for this purpose. 


We will discuss the Weil conjecture for the simplest nontrivial case (elliptic 
curves over finite fields) and prove the Riemann Hypothesis in this case. For 
a curve of higher genus, see [4], [35] and [36]. For Deligne’s proof of the Weil 
conjecture, see [16]. 


Projective Space P4@ 


Let V be the projective space P¢(F,) of dimension d with F,, the algebraic 
closure of F,. As an example, we compute its zeta function. But before doing 
so, let us be clear what it is. 


Let K be any field. The projective space P4(K) consists of nonzero points 
(z9,@1,...,2q4) of K+! with two such points «© = (20,21,...,%4), y = 
(yo: Y1,---,Ya) representing the same point of P¢(K) if y = ca for a nonzero 
cin K. In other words, # and y represent the same point P of P4(K) if their 
coordinates are proportional. For this reason, we write P = (a : 2: +++: Xa). 
If one such representative satisfies a homogeneous equation in d+ 1 variables, 
so do all other representatives of P. Thus it makes sense to talk about the 


Riemann Hypothesis for Elliptic Curves over Finite Fields 141 


solutions to a homogeneous polynomial equation in P¢(K). The set of solu- 
tions in P¢(K) to a finite number of homogeneous polynomial equations in 
d+1 variable is called a projective variety. In particular, P¢(K) is a projective 
variety defined by the zero polynomial (one with all its coefficients = 0). If the 
coefficients of these polynomials are in a subfield k of k’, we say the variety is 
defined over k. For a field L such that k C L C K, the set of points in P¢(L) 
of a variety V defined over k is denoted by V(L). 


Now let k= Fy, kK = Fy and V = P?. Then 


grat) 4 qd 
N-(V) = |V(Fqr)| = =i = ye 


Hence 


Applying the exp map to each side we get 


1 
(1—t)(1—qt)...(1— qt) 


Zy(t) = 


10.3. Riemann Hypothesis for Elliptic Curves over Finite 
Fields 


We denote by F the elliptic curve defined by its Weierstrass equation 
yi =x? +ar+b 


i.e. with a, b in Fy, ¢ a power of a prime p. 


As said, its zeta function Zz(t) has something to do with counting points 
on E£. To see what to expect, suppose the values of 2°? + ax + b are evenly 
distributed as x varies over Fj. When x* + ax + b = 0, we get one point 
of E. Because q is odd, half of the gq — 1 nonzero values of x? + ax + b are 
expected to be nonsquare giving no point of EF. For the other half, we have 
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(+y)* = 2? +axr+6 for some y in Fy, giving two points of F for each of the wt 
x in F,. Thus the expected number of points N, = |E(F,)| =1+2- at =4q. 
We define the integer 

Aq = q(E) =q— Ng (10.17) 


to be the deviation of N, from the expected number g of points on E(F,). It 
then turns out that (cf. [37, Prop. 12.1]) 


Zn(t) = an (10.18) 


In 1936, H. Hasse proved the estimate 


|aq(E)| < 2/4 (10.19) 


for ag(E), which is called the Riemann Hypothesis for elliptic curves over 
finite fields for the following reason. 


Recall the convention t = q~*. Thus if Zz(t) = 0, then q* is a root of the 
polynomial 
f(u) =w -—agu+g¢. 


The inequality (10.19) holds if and only if the discriminant a7 — 4q of f(u) is 
< 0, which is true if and only if the two roots u1, ug of f(u) are either real 
and equal, or are a pair of complex conjugates. Since the constant term q of 
f(u) is the product uzu2, (10.19) holds if and only if both roots of f(u) have 
absolute value ,/q, if and only if for all s with Zp(q~*°) = 0, |q*| = /q. This 
implies that Re(s) = 1/2. 


Broadly speaking, it is the geometric interpretations that have led to the 
proof of the Riemann Hypothesis, in algebraic geometry, while the case of 
Riemann’s original zeta function remains so intractable. 


An Elementary Proof of Hasse’s Theorem 


We now prove the Hasse theorem (inequality (10.19)), equivalently the 
Riemann Hypothesis for elliptic curves over finite fields. The proof is essen- 
tially that of Manin [26], which in itself is based on the original one by Hasse. 
To begin with let us assume that k is any field that does not contain F2 or F3 
as a subfield. Then, an elliptic curve E over k is defined by (10.10): 


y=xe+ar+b (a,b€k) 
with 4a? + 27b? # 0. 


If K is any field containing k, then the set E(K) consisting of points on it 
with coordinates in K together with its point O at infinity forms an Abelian 
group. For k = Q and K = R, assuming x? + ax +b has only one real root, it 
looks as in Figure 10.1. 
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Pi +P, 


FIGURE 10.1: Adding points on elliptic curves. 


As observed earlier, the point O at infinity is on each end of every vertical 
line. The sum of two points P,, P2 is the reflection in the x-axis of the third 
point P3 of intersection of the line through P, and Py» (tangent to (10.10) at 
P if P, = Pp = P) with the cubic (10.10). One then checks that O is the zero 
of the group, and that the inverse of a point (, y) is given simply by (a, —y). 


Twists 
To prove the inequality (10.19), we shall be working with another elliptic 
curve closely related to E. It is defined over the function field AK = F,(t) by 
dy? = 22> +ar+, (10.20) 
where \ = \(t) = t? + at +b. The elliptic curve E, given by equation (10.20) 
is a twist of E. 


If «(P) denotes the x-coordinate of a point P, we compute «(P; + P2) for 
P,, Py in E\(K) = {(a, y) € K? | Ay? = 2? + ax +b} U{O}. This formula for 
x(P; + P2) in terms of x(P,) and x(P2) plays a dominant role in the proof of 
inequality (10.19). We leave aside certain cases (such as z(P,) = x(P2); Pi or 
Pz =O) that we do not need for our proof. 


Suppose P; = (X;,Y;) € E,(K) for 7 = 1,2. To compute «(P; + P2) we 


goti 
write the equation of the line through P, and P2, which is 
Y, — Yo 
_ _ L 10.21 
y= (y=) e+ (10.21) 


To find the x-coordinate X3 of the third point P3 of intersection of this line 
with the cubic (10.20), we substitute for y from (10.21) in (10.20) to get 


1~ 
3 1 2 2 
x” —xX (= 2) x + 0. (10.22) 
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Since X,, X2, X3 are the three solutions of (10.22), the left side of (10.22) is 


(a X1)(x X2)(x X3) 
=a? = (X,+Xo+X3)27+---. (10.23) 


Comparing the coefficient of x? in (10.22) and (10.23), we get 


Way \ 
a(P, + Pz) = X3 =r ( ——* |) - (xX, + Xd). (10.24) 
X, — X2 
Frobenius map 


A crucial ingredient in the proof of inequality (10.19) is the Frobenius map 
® and its elementary properties. For a fixed q, let K be any field containing Fy 
as a subfield. We define 6 = 6, : K — K as the function given by ®(X) = X%. 


We summarize the properties of the Frobenius map. 


Theorem 10.12. The Frobenius map ®(X) = X% has the following proper- 
tres: 
i) (XY)¢ = X4Y4, 
ii) (X+Y)0= X44 4, 
iii) Fy = {ae K | ®(a) =a}. 
iv) For $(t) in F,(t), ®(¢(¢)) = O(t4). 

Although it is not used directly in this proof of the Hasse inequality, it is 
worth noting that iii) above implies that E(F,) consists precisely of points 
fixed by ®. Other proofs of the Hasse inequality use this fact directly. 

Proof. i) is trivial. 

ii) We use induction on r = log, q. Ifr = 1, q = p and 

(X+YpP=)0 ( ,) XfyP-5, 
j=0 


For 0 < j < p, the binomial coefficient (*) satisfies 


() em? ™ 


for some positive integer m, because nothing in the denominator can cancel p 
in the numerator and (7) is a whole number. Since pa = 0 for all a in K, ii) 
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follows. For r > 1, by the induction hypothesis 


(X+Y)0=((X+Y)""y 


= ae 4+yPr yp 
=XT+Y%, 


iii) The set F of nonzero elements of Fy is a multiplicative group of order 
q — 1. Therefore, by elementary group theory, a?~' = 1 for all a in oe 
In other words, each of the q elements of F, is a root of the polynomial 
t? —t = t(t?-! — 1) of degree g. Since a polynomial of degree q cannot have 
more than q roots, F, consists precisely of the elements of kK which are roots 
of t? — t. This proves iii). 


iv) follows at once from i), ii) and iii). 


Counting points on elliptic curves 


We now use the properties of the Frobenius map ®, and the group law 
on the elliptic curve E(K) to count the number of solutions of the equation 
y? =a +ar+b (a,beF,, ¢=p", 4a + 276? £0) with z, y in Fy. 


Clearly (t,1) and its negative —(t,1) = (¢t,—1) are in E)(K). Using the 
properties of ®,, it is also clear that the point 


Po = (t%, (2 + at + 0)-Y/) 
is in Ey(K). 


We define a degree function d, which we will show to be a quadratic poly- 
nomial with nonreal roots. Its discriminant plays a central role in the proof of 
inequality (10.19). For n € Z, let 


Ey = Py +n(t,1), 
the addition being the one on E)(K). Define d: Z > {0,1,2,...} by 


0, if P, =O; 
deg(num(a(P,,))), otherwise. 


d(n) = dy, = 


Here num(X) is the numerator of a rational function X € F,(t), taken in the 
lowest form. The values of this degree function on three consecutive integers 
satisfy (for a proof, see [10]) the following identity: 


Basic Identity 


dy tg = Od 4-9, (10.25) 
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The crux of the proof of (10.19) is the following theorem relating the degree 
function to the number N, of solutions of y? = x? + ax +b with 2, y in Fy. 


Theorem 10.13. 
d_j—dy-1=N,-4@. (10.26) 


Proof. Let X, = x«(P,). Since Po 4 (t,1), we have P_; 4 O, so d_; = 
deg(num(X_1)). We therefore compute X_; and look at the degree of its 
numerator when it is in the lowest form. By (10.24), 


(#8 + at + b)[(¢2 + at + b)G-Y/2 4+ 1)? 
(t9 — t)? 


- t?7+1 4+ lower terms 
aC 


X_1= 


(7 +1) (10.27) 


where the last expression is obtained by putting the previous one over the 
common denominator (t% — t)? and using property iv) of the Frobenius map. 
We must cancel any common factors in the last expression. Since the term 
t? +t has no denominator, it suffices to compute the cancellation in the first 
term of the previous expression. 


Property iii) of the Frobenius map is, as noted in its proof, equivalent to 
the fact that F, consists precisely of the g roots of t? — t. Hence 


t?-1= || (t-a), 
ack, 


so to compute d_, we wish to cancel all common factors of the fraction 


(€8 + at + b)[(42 + at + b)G-D/2 4 1)? 


I] ¢- oy 


ack, 


The only factors to cancel from the denominator of this quotient are either 
i) (t—a)? with (a3 + aa 4+ b)¢-Y/? = -1 or 
ii) t—awitha?+aa+b=0. 
[Recall that t? + at +b has no repeated root.] Let 


m = the number of factors of the first kind, 
mn = the number of factors of the second kind. 


Since factors of the first kind are coprime to the factors of the second kind, 


d_, = 2q+1-—2m-—n. 
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Since do = q, this gives 


d_,-—dg -1=q-—2m—n. (10.28) 


Now an a in F, with a® + aa +b equal to a nonzero square in F, will give two 
solutions of y? = x?+ax+b, whereas there is only one solution of this equation 
when a? + aa + b = 0. Moreover, Euler’s criterion says that a? + aa +b is a 
nonsquare if and only if (a® + aa + b)(@-))/? = —1, so m counts the number 
of a which do not correspond to any solution of y? = «° + ax + b. Hence 


Ng = 2q-—n-— 2m, 


or 
Ng-q=4-2m—n. (10.29) 


Equation (10.26) follows from (10.28) and (10.29). 


Theorem 10.14. The degree function d(n) is a polynomial of degree 2 in n. 
In fact, 


d(n) = n? — (d_1 — dy —1)n + dp. (10.30) 
Proof. By induction on n. For n = —1 and 0, (10.30) is a triviality. By the 
Basic Identity and the induction hypothesis, 
dn41 = 2dn — dn—1 +2 
= 2[n? — (d_1 — dy —1)n + do] 
(a — 1)? —(d_1 — dp — 1)\(n — 1) +p] +2 
= (n+ 1)? — (d_y — do —1)(n +1) + de. 


The induction step in the other direction can be carried out in a similar 
manner. 


Proof of the Riemann Hypothesis 
We consider the roots 71, x2 of the quadratic polynomial 
d(x) =2* —(N,-qx+4. 


Suppose that (10.19) fails to hold, so that the discriminant (Ng — q)? — 4q is 
positive. Then x1, £2 are distinct real numbers, say 71 < x2. By the way it is 
constructed, d(a) takes only nonnegative integer values on Z, so there must 
exist some n € Z such that 


nla <a cnt. (10.31) 
Since the coefficients of d(x) are in Z, we have x1 + 12, 41-2 € Z. Hence 


(a1 — az)” = (a1 + r2)° — 47179 € Z, 
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and for (10.31) to hold, we must have x; = n, x2 = n+ 1. But we note that 
“1X2 = q is a prime power, so this could only happen if gq = 2 and n = 1 or 
—2, which is a contradiction since we have assumed throughout that p ¥ 2. 
We thus conclude that (10.19) must hold, as desired. 


11 


Epilogue: Fermat’s Last Theorem 


11.1 Fermat’s Last Theorem 


This book began with the discussion of Fermat’s Last Theorem and it is 
appropriate to end it with the news that FLT is now a proven theorem, thanks 
to the work of numerous mathematicians. The strategy for this proof was 
suggested by G. Frey [17]. Based on some heuristic arguments, he concluded 
that if FLT is false for an odd prime @ > 5, ice. if 


a +vpo=c8 


holds for integers a, b, c where abc # 0, then the curve (now called the Frey 
curve) 
y” = «(x — a*)(4 + 0°) 


will have some properties which would assure it cannot exist. The so-called 
Taniyama-Shimura conjecture (1955-1957), which became widely known 
through a paper of Weil in 1967 asserts that every elliptic curve defined over 
Q is modular. In 1986, Ribet [30] proved that the truth of the Taniyama- 
Shimura conjecture for a certain class of elliptic curves (the semistable case) 
would be enough to imply FLT. It is this special case of the Taniyama-Shimura 
conjecture which in 1994 Sir Andrew Wiles proved (cf. [42]) to complete the 
proof of FLT. However, by the work of Breuil et al. [6], the Taniyama-Shimura 
conjecture is now a proven fact called the Modularity Theorem. 


To give a vague idea of the Taniyama-Shimura conjecture, recall [8, Ap- 
pendix] that an elliptic curve E'/C defined by its Weierstrass equation 


y? = 42° — gox — 93 


can be realized as a torus C/L for a suitable lattice L in C. The map ¢ : 
C/L =, E given by o(z) = (@(z), 9'(z)) parameterizes E by the Weierstrass 
g-function and its derivative, which are invariant under the translation by 
elements of L, ie. o(z + w) = e(z) for w in L. Similarly, a modular elliptic 
curve is an elliptic curve E'/Q that has something similar to do with the action 
of the modular group I'9(N) on the upper half plane h = {z € C | Im(z) > 0}. 
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By definition, To(V) is the group of 2 x 2 matrices 


a Ob 
u=(¢ a) 
over Z, of determinant 1 with c divisible by the integer N > 1. The aforemen- 
tioned action of [9(N) on § is given by the fractional linear transformations 


az+b 


zZ>Mz= : 
czt+d 


Roughly speaking, a modular function of weight k and level N is an analytic 
function f(z) on the upper half plane h with some growth conditions such 
that 
f(Mz) = (cz + d)* f(z) 
for all M in To(N). 
An elliptic curve E/Q has an equation 
y =2°+Ar+B (A,BEZ) (11.1) 


with its discriminant A = —16(4A? + 27b?) minimal. Its conductor N is a 
certain product of primes dividing A. The modularity conjecture states that 
the points on such an elliptic curve are parameterized by modular functions 
f(z), g(z) of weight 2 and level N. In other words, (11.1) may be written as 


(9(z))? = (f(z) P +A f(z) +B. 


11.2 An Alternative Approach to Proving FLT 


The following conjecture of Masser (1985) and Oesterlé (1988), also called 
the abc conjecture offers, apparently, a simpler approach for proving FLT for 
sufficiently large exponents. 


Suppose an integer n > 0 is a product of powers of distinct primes p1,..., pr. 
The radical of n is the squarefree integer rad(n) = p1... pr. 


The abc conjecture. Suppose for mutually coprime integers a, b, c with 
abc # 0 we have a+b =c. Given € > 0, there is a constant k = k(e) such that 


max/(|aJ, |b|, |c]) < k(e)rad(abe)***. 


Proof of FLT. If u, v, w is a nontrivial solution of «” + y” = z” with u, v, 
w mutually coprime, we may assume that max(|ul,|v|,|w|) = |w|. By the abc 
conjecture, 

|w|” < k(e)rad(|w))PO*9. 


This shows that FLT can fail only for a bounded set of exponents n. 
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direct product, 51 
Dirichlet’s unit theorem, 63 
discrete, 57 

discrete valuation, 36, 102, 133 
discrete valuation ring, 134 
discriminant, 15, 48, 50, 137 
discriminant of, 27 
distributive, 10 

divisor, 14 

divisors of zero, 21 

domain, 4, 13, 21 
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elliptic curve, 137 

equivalent, 37 

equivalent absolute values, 134 
equivalent valuations, 133 
Euclid, 1 

Euclidean domain, 13 

Euler ¢-function, 53 

Euler product, 131 


factor, 14 

field extension, 11 

finite A-module, 21 

finite field extension, 11 

finite-dimensional, 11 

finitely generated, 14, 21, 30 

finitely generated Z-module, 43 

First Isomorphism Theorem, 18 

fractional ideal, 35 

free, 14 

free A-module of rank, 23 

Frey curve, 149 

Frobenius map, 144 

full lattice, 57 
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fundamental theorem of arithmetic, 
30 

Fundamental Theorem of Geometry, 
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fundamental units, 65 


Galois extension, 12, 91 
Galois group, 12, 91 
general linear group, 9 
generators, 30 

genus, 136 

geometric case, 137 
geometric mean, 68 
global field, 132 

greatest common divisor, 37 
group, 9 

group law, 9 

group of roots of unity, 63 
group of units, 57 


homomorphism, 11 
homomorphism of A-modules, 23 
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integers, 2 

integral basis, 27 
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integrally closed, 33 
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inverse ideal, 35 

isomorphic, 11 


kernel, 17 
Krull dimension, 33 


lattice, 57 

least common multiple, 37 
Lebesgue measure, 59, 66 
level, 150 

linear map, 11 

linear transformation, 11 
local ring, 49 

localization, 49 
logarithmic imbedding, 62 


maximal ideal, 29 
minimal polynomial, 24 
Minkowski’s constant, 70 
Minkowski’s Lemma, 59, 69 
Minkowski’s theorem on 
discriminant, 101 
modular function, 150 
modularity conjecture, 150 
Modularity Theorem, 149 
module over, 10, 21 
monic, 3, 20 
monic polynomial, 43 
morphism, 11 
multiple, 14 
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multiplicative inverse, 9 
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natural numbers, 1 
nilpotent, 53 
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non-singular, 136, 138 
nonzero ideal, 29 

norm, 14, 136 

norm Euclidean, 14 

norm of the valuation, 134 
normal, 13 

normal closure, 44 

normal extension, 91 
normalized valuation, 37, 133 
number field, 4, 20 


object, 10 

obtained by adjoining, 19 
open ball, 57 

order, 4 

ordered basis, 23 


Pell equation, 16, 57 

points at infinity, 137 

prime divisors, 45 

prime element, 14 

prime ideal, 29 

primes, 1 

primitive Pythagorean triplet, 2 
principal, 14 

principal ideal, 30 

principal ideal domain, 4, 14, 50 
principal ideal ring, 30 

product, 9 

projective, 139 

projective space, 140 

projective variety, 141 

proper, 10 

proper ideal, 29 
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quadratic field, 20 
quotient, 11, 13, 29 
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ramified, 45 

ramifies, 15 

rank of the lattice, 57 

real imbedding, 60 

real quadratic field, 66 

reduced, 54 

regular, 4 

regular prime, 8 
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remainder, 13 

residue class degree, 45 

residue classes, 10 

Riemann Hypothesis, 131, 140 
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ring, 9 

ring homomorphism, 25 

ring of integers, 20 


scaling, 10 

semi-local ring, 50 
set theoretic difference, 49 
simple extension, 19 
singular point, 138 
smooth, 136, 138, 139 
solvable extension, 99 
spans, 11 

splits completely, 47 
stays prime, 47 
submodule, 10 
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totally ramified, 47 
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trivial, 134 

twin primes, 1 
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